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PAY UP TO 
72% LESS 
FOR YOUR 
BUSINESS 
INTELLIGENCE 


Dell and Microsoft can help reduce the cost and complexity of Business 
Intelligence. Our systems are quick to install, easy to manage, and built 

with standard components that work with what you already use — all for 
up to 72% less per terabyte than the competition? 


SIMPLIFY YOUR IT AT DELL.COM/SQL N 


*72% claim based upon a comparison of list prices of typical Business Intelligence offerings from leading hardware manufacturers versus 
Dell/Microsoft combined offerings. Benchmarked systems configured with 4-5 TB of data storage, database application software, and 
Business Intelligence analytic software. Dell is a trademark of Dell Inc. ©2007 Dell Inc. All rights reserved. 


Number of silos in which your customer data resides. 
Coincidentally, also the number of totally different 
definitions you have for "customer? 


definition — 
Embarcadero solutions help manage data 
enterprise-wide, enabling you to build and 
communicate an architecture of your data assets 
so you always know what is what, and where it is. 
With just one, everything gets simpler. You and 
your business are empowered to use and reuse 
reliable, relevant, and timely data across a range of 
platforms and storage formats. Understanding your 
data is finally possible, and getting a hold of the 
metadata is the key. Standards can be defined and 
enforced. Data redundancy is eliminated. It's no 
coincidence that the most sophisticated solution is 


also the simplest. See how one gets it done. 


Call 415.834.3131 or visit www.embarcadero.com 
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Q Editor's Note 


In the cover 

story, Megan and 

Jeff invite you to 
comment online at 
InstantDoc ID 97289 about 
your experiences with $QL 
Server products, but 

the online article isn't the 
only place to join the 
discussion. Share your 
trials and triumphs with 
SQL Server tools in the 
Tool Time with Kevin Kline 


forum at www.sglmag 


.com/go/tooltime and the 


3rd Party Products forum 


at www.sqlmag.com/go/ 


forum/3rdpartyproducts. 
—Christan Humphries, 


| jon. edit 
TS or 


18 — Megan Bearly and Jeff James 
Learn which Microsoft and third-party products will have the tools you 
need to stay on top of SQL Server development, administration, and 


business intelligence (BI) challenges in the coming year. 
InstantDoc ID 97289 


FEATURES 


23 SQL Server Read- 
Consistency Problems, 
Part 1 


—Itzik Ben-Gan 

This two-part series promises to shed some 
light on the way reads behave in different 
scenarios, as well as the consistency level you 
should expect in each scenario. 

InstantDoc ID_97278 


33 Running SQL Server їп 
a Virtual Environment 


—Alan Sugano 
Optimize server performance with these 


InstantDoc ID 95799 


38 Working with 
Estimated Query 
Execution Plans. і 
—Tyler Chessman ۰ ә 
If you want to be able to s complexity 
and size of queries, consider implementing 
estimated execution plans in your ү” 


2] Managing Plan-Cache 
Size 
—Kalen Delaney 
SQL Server 2005 doesn't limit plan-cache size 
but still provides ways to control it. Learn how 


to keep an eye on cache size and tricks to trim InstantDoc ID 97267 ۷ | 
down а bloated cache to improve performance. 0 
InstantDoc ID 97373 ۳ 
eo 41 Sharpen Your Basic 
SQL Server Skills 


31 Data Warehousing: 
Dimensional Role- 
Playing 
—Michelle A. Poolet 
Dimensional roles can play many different 
parts in an accumulating snapshot fact table. 
Find out how dimensional role-playing can 


—— — — benefitvourstaesehenta-———. 
InstantDoc ID 97272 


—Pinalkumar Dave 
Learn the difference between FI 
TOR=0 and FILLFACTOR=100; JLI 
and NULLIF; and % used asa wildcard ~ 
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With HP PolyServe Software, IT 
organizations can improve SQL Server 
management while attacking the 

costs associated with large database 
deployments. Customers can: 


* Dramatically reduce server count 


* Slash cluster configuration and 
management time 


* Virtually eliminate the cost of high 
availability 


e Realize significantly lower total cost of 
ownership for SQL Server 


"With HP PolyServe, manual 
build time has been automated 
from days to hours and failover 
time has been reduced fivefold. 
System management is much 
more efficient." 


- David Miller, global solution 
director, Avanade Inc. 


For SQL Server consolidation, 
there's only one solution you 
need to know. 


HP PolyServe Software for Microsoft? SQL Server" is the only 
software solution designed specifically for consolidating your 
mission-critical SQL Server databases. HP PolyServe Software's 
Dynamic Rehosting feature provides instance mobility, optimizing 
capacity allocation and simplifying infrastructure administration. 
For IT administrators, HP PolyServe Software dramatically 
increases control over essential tasks, such as: 


Reducing costs: Consolidate to a dramatically smaller server and storage 
footprint by stacking multiple default and named instances onto fewer servers 


Expanding capacity: Easily add new servers or storage, and automatically 
provision new instances, in seconds 


Reallocating resources: Promote a SQL Server instance to a larger server 
in less than 30 seconds 


Maximizing uptime: Simply, easily configure high availability services for 
all SQL Server instances 


Minimizing maintenance windows: Perform rolling patches and 
updates while minimizing instance offline time to seconds 


Visit www.hp.com/go/polyserve 
Call 1-877-476-5973 


Technology for better business outcomes. 


Microsoft is a registered trademark of Microsoft Corporation, and SQL Server is a trademark of Microsoft Corporation. The information herein is subject to change without notice. © 2007 Hewlett-Packard Development Company, L.P. 
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VMs vs. Multiple 
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Community Dialog: 
Want to talk SQL Server? 
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43 Industry News: 
Bytes from the Blog 

Jeff James shares insights from his 
visit to Denver to attend the 2007 
PASS Community Summit. 
InstantDoc ID 97215 


44 Product Review: 
PolyServe Database 
Utility for SQL Server 
—Ron Talmage 
HP PolyServe Database Utility for 
SQL Server offers simplified consoli- 
dation and failover for SQL Server, 
but still requires careful planning 


InstantDoc ID 97338 


45 New Products 
Check out new and improved 


SQL Server-related products. 
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Altova® XMLSpy® 2008 — The industry standard XML development environment. 


ALTOVA 
2۲۲۳۱۱5 ۳۲ 


Immerse yourself їп 


XML intelligence j 


Dive into XMLSpy® 2007, and take 
XML development to greater depths. 


Spied in Version 2008: 

e Support for Microsoft® Word, Excel, and now PowerPoint Open XML formats 
* Direct editing of database content via database query window 

e XPath evaluation across multiple files at once 

* Support for XInclude and XPointer 


Altova? XMLSpy, the industry standard ) | 
XML development environment, is vital 

for modeling, editing, transforming, : 

and debugging XML applications. 

Breathe life into your plans 
with the world’s leading XML жү 

editor, the original graphical ۲ Ж А 

schema designer, а Cod es p 75 = y А A E 
generator, file converters; ү As se 
debuggers, profilers, full << 

database integration, suppc 
for XSLT, XQuery, WSDL, SOAI 
and a trove of treasured XML 
utilities and usability aides. 
Become a markup mastermind! - 2 


Download XMLSpy 200840985: 
www.altova.com E 


XMLSpy is also available as part of в 
the value-packed Altova MissionKit" ш 
software bundle. А 
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EDITORIAL 


Michael Otey 
motey@ sqlmag.com) is technical director for 


Windows IT Pro and SQL Server Magazine and 
coauthor of SQL Server 2005 Developers Guide 
(Osborne/McGraw-Hill). 


Q Editors Tip: 


Do you need help deciding 
how to consolidate your 
servers? Do you have a 
solution that could help 
others? Read your peers' 
consolidation recommenda- 
tions and tips and post 
your own opinion at www 
sqlmag.com/go/forumtopic/ 
consolidation. 
~~ Christan Humphries, 
production editor 


i= 


Christan 
Humphries 


(chumphries@ penton.com) is production 
editor for Windows IT Pro and SQL 
Server Magazine. 
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VMs vs. Multiple 
SQL Server Instances 


physical hardware and, aside from whatever work- 
load might be generated by the queries running on 
the server instance, there is no additional overhead. 

Virtualization and server instances are similar in 
their administrative aspects, but running multiple 
SQL server instances has a couple of advantages. Be- 
cause there’s only one installation of the OS and the 
server software, server instances require less patch- 
ing. With virtualization, each VM has its own guest 
OS and server products, which must be managed and 
patched. In both solutions, hands-on server manage- 
ment tasks are performed using Enterprise Manager 
or SQL Server Management Studio. 

In terms of deployment and recovery, virtualiza- 
tion has an edge on its competition. With virtualiza- 
tion, you can build base images that can be deployed 
to a new server in a matter of minutes. Then, a VM 
image can be backed up or replicated to a remote site 
and, in the event of a failure, could be brought in-line 
in minutes or even seconds. 

Another factor to consider is licensing. Multiple 
SQL server instances don’t require any additional 
SQL Server licenses. For virtualization, each SQL 
Server system that’s running in a guest VM must be 
licensed—unless you're running SQL Server 2005 En- 
terprise Edition, which provides unlimited VM usage. 

The bottom line is that both methods can be effec- 
tive solutions for server consolidation. To help you de- 
cide which solution is right for you, see the resources 
listed in this month’s Your Savvy Assistant. But remem- 
ber, just because virtualization is today’s hot technology 
doesn’t mean that it’s the only game in town. SQL 

InstantDoc ID 97439 


InstantDoc ID 93208 а. 


Itiple SQL Server Instances 


erver: Managing Multiple 
Instances,” InstantDoc ID 9021 

“An Instance of Understanding,” InstantDoc 
ID 93187 


For more resources, see my extended blog post at 
www.Windowsitpro.com/go/SavvyAssistant. 


М irtualization is опе of the most rapidly growing 
technologies in IT today. Driving this growth is 
virtualization’s ability to decouple the server from the 
underlying hardware, letting you run multiple virtual 
machines (VMs) on a single hardware platform and 
easily move the VMs (along with their underlying 
guest OS and servers) to different hardware platforms. 
Many organizations have adopted this technology as 
a server consolidation method. But virtualization isn’t 
the only way to consolidate servers. Every release since 
SQL Server 2000 can support multiple SQL Server in- 
stances on the same system—unlike other Microsoft 
server products. 

Multiple server instance support in SQL Server 
essentially lets you run multiple occurrences of SQL 
Server’s database engine on the same server. SQL 
Server 2005 Enterprise Edition supports up to 50 
server instances. ISPs and other hosting scenarios 
often use multiple SQL server instances so that mul- 
tiple clients can each have control of their own SQL 
Server database server without a dedicated hardware 
platform. So how do virtualization and multiple SQL 
server instances compare? 

The first, and in many cases the most important, 
factor to consider is performance. Because of their 
nature, virtualized servers don’t provide the same 
level of performance as a physical machine because 
the virtual environment creates additional overhead 
that takes away from performance. Most estimates 
put VM overhead at about 10-15 percent. In a server 
consolidation environment, the impact will be even 
greater because of the workloads supported by the 
other VMs. In contrast, server instances run on the 
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solidation (as he often does), but | made‏ 
him keep it short to make room for my picture.@)‏ 
Luckily, we don't have to squeeze all the informa-‏ 
tion onto this tiny page. You can learn more about‏ 


server consolidation options an deciding 
which is the best solutio ou from the 0 


Virtualization 
Format War,” InstantDoc ID 9; 
"Market Watch: 7 ی‎ Products,” 
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Introducing the SQL Server Magazine! 


Easy-to-find icons in print carry-through to the web for dig- 
ging in and getting the in-depth answers and copyable code you're looking for—at a 


light-speed pace!‏ و 
SQL SERVER With new interactive features like, “Test Your Skills"‏ 
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DPM 2007 Pricing 

This tabla {Eom http? Server License ۵ 
تب‎ cama FRONTE 
sy:temcenter/'dpm./ Data Protection 

howtobuwyy default. mspx) Manager Server 

shows the estimated. prices 2007 


Package that Includes (1) Server 
license for application and file 
server management, and (23 


$1,269 U.S. 
iretall price) 


available at press time for 
the initial release of DPM 
2007. The Enterprise 


Data Protection 


Manager Server 
2007 with 1 


Enterprise server ML far Backup 
and recovery management of ap- 
plication Including Micresoft SOL 
Server, Microsoft Exchange Server, 
and Microsoft Offlce SharePotnt 
Server, 


Enterprise Server 
ML (avallable In 
retall channel 


anty) 


Data Protection Manager Management License” (ML) Offerings 


Backup and recovery manage- 
ment of applications Including 
Microsoft SOL Server, Microsoft 
Exchange Server and Microsoft 
Office SharePoint Server, Also 
Includes the Microsoft System 
Recovery Tool DPA to СРМ 
replication, and host-based 
virtual server backup 
functionality. 


Backup and recovery manage- 
ment of flle servers, 


Enterprise $426 U.S, 
Server ML 


Server license, as noted 
in the chart, is required 
for using DPM with the 
Microsoft key workloads. 


The Dell SQL Server Story 


For users of SOT Server, Dell Computers 

has had a compelling story Dell server hard- 
ware has twice been selected as "SQL Server 
Magazine Readers" Choice" for best server 
hardware, as well as reader” choice for Hest 
High Availability Solution, Best MAS Solution 
and Best SAM Solution. This bas come to pass 
in part because Dell is an ОЕМ of SCIL Server 
and hæ been building servers and integrated 
solutions for the SOL Server market that can 
deliver a soup-to-outs SL Server installation 
tn its customers. 


Dell tests its supported SCIT. Server erviran- 
ments in its own labs and offers mot just SCIL. 
Server but integrated SQL Server management 
solutions that make use of the current suite of 
Microeodt system management applications. 


Dells hardware offerings range fom standalone 
servers to complete clustered bhigh-availability 
solutions, wath storage and backup hardware, 
DAS, MAS, and SAN storage; and a cam- 

plete service organizatinn that offers planning, 
migration, installation, traning, and ongoing 
заррогї services. 


The Dell SQL Server Solutions Web site at 
Вери لاس مهم‎ com/ content! topics global 
ampx заве јез ‘solutions'sottwares db/rnicrosoft | 
ql 2006? cH sie 55581=етп == offers 
دوه‎ сапе information an SCIL Server best 
practices, deployment, and business intelligence. 
You also can find planning and deployment 
tools, such as the SQL Server Advisor Tool, at 
htp: ^ erwdell.cam/content/topacs/ global 
aspx /bocds/adiisom/sgl وم اند‎ 
553&l—en&s-buz. These resources can 1 
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MATTERS! 


GET ACTIVE ONLINE AT SQLMAG.COM! 


Join the action on the SQL Server Magazine forums, at sqlforums.windowsitpro 


SOLMAG.COM 


Community Dialog 


Check out 


on a different server is a guarantee for the creation of a 
DBAs nightmare,” countered SQLUSA. And Pro Pete 
responded, “I’m not sure really why there is all this fuss 
regarding the restore/recovery process. We are all agreed 
that whatever recovery/restore approach is used, the key to 
a viable strategy is to ensure that you have appropriately 
up-to-date backups of everything...” Discussions like this 
one show that there’s no shortage of good ideas and strong 
opinions in the SQL Server Magazine community! 


Feedback Wanted 


One of the favorite forums of SQL Server Magazine 
editors is the Feedback forum (sqlforums.windowsitpro 
.com/web/forum/categories.aspx?catid=79&entercat=y). 
Our success depends on how well the magazine and Web 
site bring you the information that’s relevant to your jobs 
as SQL Server professionals. So we want to hear from 
subscribers and online members about what SQL Server 


.com. This month's featured post is dbbyleo's "Backup/Recovery Philosophy... pull 

up a chair,” at sglforums.windowsitpro.com/web/forum/messageview.aspx?catid-7 4& 
threadid=87792&enterthread=y. Be sure to log on, or if you're not already a registered 
user, click the green Register button at the top right of the page to sign up. 
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Magazine and Sqlmag.com are doing well—or need to 
improve. We're all about access, so stop by the Feedback 
forum and tell us how we can help deliver the information 
you need. 

Another way to give feedback to SOL Server Magazine 
editors and authors is by commenting on articles. Editors 
and authors subscribe to comment RSS feeds, so we can get 
timely feedback from readers and respond quickly. To com- 
ment on an article, you need to be a registered user. Simply 
scroll down to the bottom of the article, rate the article, 
and post your comments. And of course, you can always 
email any of the SOL Server Magazine authors or staff. For 
more information, see the Contact Us page at www.sqlmag 


.com/aboutus/index.cfm?action=contactus. SOL 


InstantDoc ID 97469 
—Anne Grubb, Web Lead Editor, 
SQL Server Magazine 


Want to talk SQL Server? 
the Sqlmag.com forums 


aybe it's the change in seasons, or building excite- 

ment before SQL Server Magazine Connections 
and the other DevConnections conferences—but some- 
thing's gotten into SQL Server forum members to make 
them even livelier than usual. At the time of this writing, 
the SQL Server General Discussion forum (sglforums 
.windowsitpro.com/web/forum/categories.aspx?catid= 
74&entercat=y), the second most popular forum behind 
SQL Server 2005 Administration, has had 19 posts with 
69 replies in the past week—and that’s just one of 30 
forums. Here are a couple of forum areas you'll want to 
take a look at. 


Database Backup and Recovery 
Worldviews 

If you're looking for a thought-provoking online con- 
versation, head over to the ongoing discussion about 
database backup, restore, and disaster recovery in the SOL 
Server General Discussion forum at sglforums.windows 
itpro.com/web/forum/messageview.aspx?catid=74& 
threadid=87792&enterthread=y. There, you'll find forum 
member dbbyleos well-thought-out explanation of his 
approach to database backup and recovery. 

Here's an excerpt from the post: “First, I approach 
recovery from the instance level, not the individual data- 
bases. While a SQL Server instance is made up of system- 
and user-databases, I make no distinctions between them 
in terms of recovery—the instance is the whole ‘database.’ 
Second, I approach it so that it will ultimately allow me 
to do a ‘displace’ recovery... not just an ‘in-place’ restore. 
In other words, be able to restore the database on the 
current host (‘in-place’), such as in a case of a media or 
other failure where the host is intact—which is usually 
the common, easy case. But above that, be able to recover 
the database onto another host, such as in a case of a 
catastrophic failure of the original host.... In the world 
of SQL Server, there seems to be a disconnect between 
how user databases should be backed up (and restored) 
from the way system databases should be backed up (and 
restored). Most people make distinctions between these 
databases when it comes to backup and recovery." 

As you might expect, Leo's post generated some 
energetic debate, like these sample comments. “The 
trouble with your thinking is that system databases have 
some server-dependent information. So restoring them 
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Meg Neeson 


redgate’ 


www.red-gate.com/backup ingeniously simple tools for SQL Server 


Eli Leiba 


ORE on the WEB 


Download the code at 
InstantDoc IDs 97389, 
97392, and 97393. 


Qus Tip 


Share Your 
Experiences 

Share your SQL Server code, 
comments, discoveries, and 
solutions to problems. Email 
your contributions to r2r (à) 
sqlmag.com. Please include 


your full name and phone 


number. We edit submis- 
sions for style, grammar, 
and length. If we print your 
submission, you'll get $100. 
——Кагеп Bemowski, 
senior editor 
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T-SQL Stored Procedure Creates Trace Tables 


uses the ::fn trace gettable system function. This function 
transforms a trace file into a table. The LoadTraceFile 
ToTable stored procedure also adds a bigint identity 
column to sequence the trace rows, as callout A shows. 

I wrote the LoadTraceFileToTable stored procedure for 
use on SQL Server 2005. To use Load TraceFileToTable, 
you need to pass in two parameters. The first parameter is 
the pathname to the target trace file. The second param- 
eter is the name of the table in which you want to put the 
trace file's data. The trace table's name needs to be in the 
format database.schema.tablename. For example, 
if the trace files pathname is C:\ELI_TRACE.tre 
and the trace table’s name is Adventure Works.dbo 
.tre_04052007, you'd run the command 


EXEC LoadTraceFileToTable 
gtraceFileName = 'C:\ELI_TRACE.trc', 
gnewTraceTableName = 

"AdventureWorks.dbo.trc_04052007' 


Before you run this command, though, you need to 
make sure that the following conditions are met. (The 
stored procedure doesn’t check for these conditions.) 

1. The trace file must exist prior to the stored procedure’s 
execution. 

2. The trace file parameter must be the full pathname. 

3. The table name parameter must be in the format data- 
base.schema.table. 

4. The trace table shouldn't exist prior to the stored 
procedure's execution. 

5. The SELECT INTO statement must be a valid state- 
ment in the server (1.е., the SELECT INTO/Bulk Copy 
option must be enabled). 


— Eli Leiba, Senior Application DBA, 
Israel Electric Company 
InstantDoc ID 97389 


from Trace Files 


"ve always preferred loading the data from a trace (.trc) 

file into a table for analysis. For me, it's easier to sort 
and handle trace data when it's in a table because I can 
easily run queries. So, I wrote the LoadTraceFileToTable 
stored procedure. With this stored procedure, I can trace 
whatever I want, put the results in a trace file, use Load- 
TraceFileToTable to load the data from the trace file into 
a trace table, run queries against the table, and dispose of 
the table without losing the original trace file. 

The LoadTraceFileToTable stored procedure isn't 
meant to replace but rather complement SQL Server Pro- 
filer, which also lets you create trace tables from trace files. 
However, Load TraceFileToTable does have one advantage 
over Profiler. With Profiler, the trace table can reach mil- 
lions of rows during the profiling process and hence enlarge 
the master or user database. With LoadTraceFileToTable, 
the table is created after and not during the profiling pro- 
cess, so the database isn’t needlessly enlarged. 

The LoadTraceFileToTable stored procedure, which 
Listing 1 shows, constructs a dynamic T-SQL statement. 
Specifically, it constructs a SELECT INTO statement that 


LISTING |: The LoadTraceFileToTable 
Stored Procedure 


CREATE PROCEDURE LoadTraceFileToTable 
(atraceFileName varchar(200), 
8newTraceTableName sysname) 

AS 

BEGIN 
DECLARE @tsqlStmt varchar(400) 

SET NOCOUNT ON 
SET atsqlStmt = 


(А) "SELECT IDENTITY(bigInt,1,1) AS Row Number,' + 
'* ۲ + ۲ INTO ۲ + RTRIM(OTTRIM(GnewTraceTableName)) + 
۲ FROM ::fn trace gettable C ' + ۲۲۲۲ + 
@traceFileName + '''' + ',default)' 


PRINT atsqlStmt 
EXEC (atsqlStmt) 
SET NOCOUNT OFF 
END 
GO 


Strim Strips Out Non-Alphanumeric Characters 


beyond ANSI.” Specifically, a variable assignment in a 
query such as 


SELECT да = да + col 
FROM Table ORDER BY col2 


will be performed in the expected order. For more infor- 
mation about these guarantees, see the “Ordering Guar- 
antees in SQL Server 2005” entry in the Tips, Tricks, and 


© Ё Server offers a variety of string functions, 
but it doesn’t have a function that strips 
out non-alphanumeric characters, such as a hyphen (-), 
an exclamation point (!), a percent sign (%), or a semi- 
colon (;). I created a function named Strim, which is 
short for strip trim, to fill this gap. 
The Strim function takes advantage of the fact that 
SQL Server “provides additional ordering guarantees 
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ы ئ ي‎ RR шо F2 к= 
њо с о = 


Finally, the string is reconstituted as abcdefg, assigned to 
@textout, and returned. 

For comparison, let’s look at what happens if a string 
contains non-alphanumeric characters. If 


atextin="pubs-dev" 


table D’s rows would be: 


i letter 
1 p 
2 u 
3 b 
4 s 
5 d 
6 e 
1 v 


and the string would be reconstituted as pubsdev. 

Ive used the Strim function to remove nonvisible 
characters in imported data (e.g., IBM EBCDIC data). 
Another use involves creating filenames for our database 
backup files. Our company uses Ideras SQL Compli- 
ance Manager, which creates files with filenames such 
as SQLcompliance_MyDatabase SQL.DOMAIN.COM. 
Our custom backup script uses the Strim function to strip 
out the unwanted characters in the database names when 
it creates the filenames for the backup files so that the 
filenames follow our standard format of myservername_ 
myinstancename SQLcomplianceMyDatabaseSQLDO- 
MAINCOM backuptype yyyymmddhhmmss.bak. The 
Strim function works on SQL Server 2005, SOL Server 
2000, and SQL Server 7.0. 

—Louis Nguyen, DBA, Centex Homes 
InstantDoc ID 97392 


LISTING 2: The Strim Function 


CREATE FUNCTION dbo.strim (8textin VARCHAR(8000) ) 


RETURNS VARCHAR(8000) 
AS 
BEGIN 


DECLARE QtextOut VARCHAR(8000) 


SELECT atextOut='' 


SELECT atextOut=atextOuttletter 


FROM 
( 


SELECT TOP 8000 i, SUBSTRING(p,i,1) AS LETTER 
FROM Tally, (SELECT atextin AS P) A 

A WHERE (ASCIICSUBSTRING(p,i,1)) BETWEEN 48 AND 57) 
OR (ASCIICSUBSTRING(p,i,1)) BETWEEN 65 AND 90) 
OR (ASCII(SUBSTRING(p,i,1)) BETWEEN 97 AND 122) 


ORDER BY i 
ا ك‎ 


RETURN(atextout) 
ND 


LISTING 3: Code That Creates the Tally Table 


SELECT TOP 8000 IDENTITYCINT,1,1) AS i INTO Tally FROM 
master.dbo.sysobjects a, master.dbo.sysobjects b 


Advice from the SQL Server Query Optimization Team 
blog (blogs.msdn.com/queryoptteam/archive/2006/05/02/ 


588731.aspx). Note that the blog has since morphed into 


the Tips, Tricks, and Advice from the SQL Server Query Pro- 


cessing Team blog (blogs.msdn.com/sqlqueryprocessing/ 


default.aspx). 
Listing 2 shows the Strim function. This function takes 


one input parameter (@textin), which is limited to 8,000 
characters. Strim first converts the string into a derived 
table labeled A. The derived table is cross-joined with a 
table of numbers named Tally to parse the string. Listing 3 
shows the code that creates the Tally table. The WHERE 
clause at callout A in Listing 2 filters the intermediate 
output from the cross join. The clause uses ASCII codes to 
accept only numbers (ASCII codes 48 through 57), upper- 
case letters (ASCII codes 65 through 90), and lowercase 
letters (ASCII codes 97 through 122). The function then 
converts the intermediate output to a derived table labeled 
D. For example, if 


atextin="abcdefg" 


table D’s rows would be: 


Tracking for Your SQL Server Agent Jobs 


ties for each job. SQL Server Agent stores the information, 
but you need to know the tables in which it’s stored. 

Гуе written a procedure, cspJobMonitor, that accesses 
the internal tables in SQL Server and displays the sum- 
mary results set. This procedure can provide answers to 
all the aforementioned questions and more. It works in 
three modes: LastRun, Last24, and NamedJob. 

LastRun mode. When you use the cspJobMonitor 
procedure in the LastRun mode, you receive a list of 
the last run for all jobs as well as additional information 
about each of those jobs. Figure 1, page 15, shows sample 
output from this mode. As you can see, the additional job 
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ow often are you asked: 

H e When was the last time this job was run? 

e Can I have a list of every time this job was run? 

* [s my job enabled? 

* What jobs will be affected if we bring down the system 
for maintenance? 

* Did our scheduled maintenance job run successfully 
last night? 


As you probably know, answering these questions is 
more complicated than you might think. You either have 
to view the history for all the jobs or review the job proper- 
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Business Objects 


YOU'VE NEVER SEEN A REPORT DO THIS BEFORE. 


CRYSTAL REPORTS" 2008 
DEFY THE LAWS OF REPORTING. 


Add new levels of decision support, stunning visualization 
and rich interactivity to your applications. Discover the 
latest designer productivity features, an improved report 
viewing experience and a free runtime for unlimited 
internal report engine deployment. 


Add Crystal Reports to your development tool kit and... 

۰ Enable What-If analysis with Xcelsius components, 
right on your reports (as shown). 

· Guide report exploration with on-report sorting, 
filtering and reformatting without re-hitting your 
database. 

۰ Embed Flash files for stunning visualizations and 
powerful decision support. 


Explore the new laws of reporting from Crystal Reports. 
Visit businessobjects.com/CR2008/dev or contact us at 1-888-333-6007. 


Copyright € 2007 Business Objects S.A. All rights reserved. 


Get your developers what 
they really want for Christmas! 


ApexSQL Developer Studio 

a stocking full of value added features that 
will put a smile on the faces of your SQL 
developers on Christmas morning. 


д - № 
Included in ApexSQL Developer Studio: i 
: ۳ : "Great tools! | will look forward to working x | 
Active data auditing and reporting with your company in the future as you add 
| more tools. Thanks for the effort to produce А 
QL Clear Risk-free delete and dependency analysis = quality applications." v 
L Template-based code generation "Ме really appreciate the quality of your Kim Rossey 
product, as well as your willingness to Developer d 
IL Diff Database comparison and synchronization accommodate our needs. You are tops on uw ud 3 
our list when we go searching for 
Database Documentation into HTML and CHM development tools." 
Scott C. Dixon, MCP, MCP+SB, MCSD "... ApexSQL tools have become indispensable 
Full Featured Editor/IDE for SQL Server Technical Consultant to us. They have worked flawlessly from day 
CTGi one and have saved us a huge amount of time 
Database scripting, packaging and deployment and effort..." 


Steve Bernat 
President 
Izonim, Inc. 


Customizable, rule-based Standards Enforcement 


Diff SSIS Package Comparison and Analysis 


For more information SOLE 
or to download a free trial version, go to: Isi EDITORS" CHOICE A 


www.apexsql.com مك‎ L^ Apexs QL 


or phone 866-665-5500 v 


2007-05-08 0 
2007-05-09 0 
2007-05-09 0 
2007-05-08 0 


Next Run 

2007-05-08 22:00:00.000 
2007-05-09 00:00:00.000 
2007-05-08 19:40:00.000 
2007-05-08 19:40:00.000 
2007-05-08 19:40:00.000 
2007-05-08 0.00 


Next Run 

2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
2007-05-08 22:00:00.000 
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Duration Next Run 


SAS 70 auditors like to see a full year's worth of historical 
data. 

° The output isn't a replacement for SQL Server Agent's 
job history log but rather a quick summary of jobs in the 
system in a readable format. 


To use the procedure in the NamedJob mode, you run 
the command 


cspJobMonitor 'job name' 


where job пате is the name of the job for which you want 
to receive a history. 

You can download the cspJobMonitor procedure from 
SQL Server Magazine's Web site. (Go to www.sglmag 
.com, enter 97393 in the InstantDoc ID text box, then 
click the 97393.zip hotlink.) I wrote the procedure for use 


Sample output from the LastRun mode 


Sample output from the Last24 mode 


on SQL Server 2005. SQL 
—Eric Peterson, President, 
Peterson American Consulting 
InstantDoc ID 3 
Last Run Step Status 
2007-05-07 22:00:00.000 0 Success 00:00:41 
2007-05-08 00:00:00.000 0 Success 00:00:01 
2007-05-02 02:00:00.000 0 Success 04:54:27 
2007-05-08 19:43:00.000 0 Success 00:00:00 
Job Run Step Status Duration 
2007-05-07 22:00:00.000 0 Success 00:00:41 
2007-05-08 00:00:00.000 0 Success 00:00:01 
2007-05-08 18:43:00.000 0 Success 00:00:00 
2007-05-08 12:43:00.000 0 Success 00:00:00 
2007-05-08 06:43:00.000 0 Success 00:00:00 
2007-05-08 00:43:00.000 0 Success 00:00:00 
Job Run Step Status Duration 
2007-05-07 22:00:00.000 0 Success 00:00:41 
2007-05-06 22:00:00.000 0 Success 4 
2007-05-05 22:00:00.000 0 Success 00:01:32 
2007-05-04 22:00:00.000 0 Success 00:01:56 
2007-05-03 22:00:00.000 0 Failed 00:00:05 
2007-05-02 22:00:00.000 0 Success 00:01:56 
2007-05-01 22:00:00.000 0 Success 00:01:57 
2007-04-30 22:00:00.000 0 Success 00:00:43 
2007-04-29 22:00:00.000 0 Success 00:00:40 
2007-04-28 22:00:00.000 0 Success 00:00:40 
2007-04-27 22:00:00.000 0 Success 00:01:57 


Sample output from the NamedJob mode 


includes whether the job is enabled, how long the last 
run took, and when the next run is scheduled. To use the 
procedure in the LastRun mode, you run the command 


cspJobMonitor 'LastRun' 


Last24 mode. When you use the cspJobMonitor 
procedure in the Last24 mode, you receive a list of all 
the jobs that ran in the past 24 hours. Figure 2 shows 
sample output from the Last24 mode. Like in the LastRun 
output, the Last24 output includes additional information 
about each job. To use the procedure in the Last24 mode, 
you run the command 


cspJobMonitor 'Last24' 


NamedJob mode. When you use the cspJobMonitor pro- 
cedure in the NamedJob mode, you receive a list of all the 
runs that have occurred for the specified job. Figure 3 shows 
sample output from this mode. This historical information 
will be helpful to not only your boss but also Statement on 
Auditing Standards No. 70 (SAS 70) auditors. Before you 
use the NamedJob mode, though, you need to be aware 
of a few things about the 
output: 

* The "Enabled" and 
"Next Run" fields always 


Enabled 
Enabled 
Enabled 


Job Name 
Backup DBS 
Backup Sys DB 


show the results for the last | CentralizeFileS Enabled 
run. For example, in the | SQL Perfmon Enabled 
sample output in Figure 

3, the "Next Run" field for Figure | 


the Backup DBS job that 
ran on 2007-05-06 lists the 


next run as 2007-05-08 and | Job Name Enabled 
not 2007-05-07. Backup DBS Enabled 
* The sample output Backup Sys DB Enabled 
in Figure 3 lists only the | SQL Perfmon Enabled 
total job (i.e., step 0) for the SQL Perfmon Enabled 
status. If you code a job to | SQL Perfmon Enabled 
end with success or go to | SQL Perfmon Enabled 
the next step when an error Figure 2 


occurs, it will only show 
that the job completed and 
not that a single step failed. 


* The number of jobs Job Name Enabled 
| Backup DBS Enabled 
shown is controlled by 
the ейт da the histor Backup DBS Enabled 
e Setting m fe story | Backup DBS Enabled 
options in SQL Server 
Agent's Properties dial Backup DBS Enabled 
gents ۱۱۵۳۵۵ aog | Backup DBS Enabled 
box. By default, it's set at 
; ў Backup DBS Enabled 
1,000 rows, the history will 
۱ Backup DBS Enabled 
be automatically deleted 
from the system when the و‎ аш эшк 
total 0 or job-limit PD. pud 
и Backup DBS Enabled 
specification is reached. Backup DBS Enabled 
If you have several jobs, Ё 
reaching 1,000 rows doesn't 3 


take that long. Be careful— 
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Sivas ass for Microsoft® Technologies 


EYE ON BI 


cwebb @ solidg.com) is a mentor specializing 


Chris Webb 


in business intelligence at Solid Quality Men- 


tors and a SQL Server MVP. He is coauthor 
of MDX Solutions: With Microsoft SQL Server 


Analysis Services 2005 and Hyperion Essbase, 


2nd edition (Wiley). 


Getting MDX to Work in 
Reporting Services 


The workaround isn’t elegant and shouldn’t have 


2006 and 2007, and the calculated measure % Sales 
Growth for 2007 alone. 

In SSRS, you would have to write a query that had 
Sales and % Sales Growth on Columns and the years 
2006 and 2007 (cross joined with anything else you 
wanted to display in the report) on rows. This means that 
youre returning values in your query that you don’t actu- 
ally want (in this case, the values for % Sales Growth for 
the year 2006), which might have a negative impact on 
query performance if the unwanted values are returned 
by calculated members. Again, there is a workaround 
(see Web Listings 2 and 3) but performance of the 
rewritten query can be worse than that of the original. 

The problems I've described occur only when you 
use the built-in *Microsoft SQL Server Analysis Ser- 
vices" data source type. You can alternatively connect 
to SSAS using an OLE DB data source, but doing so 
means that you can't use the MDX query builder or use 
MDX parameters in your queries. The latter is a big loss 
because, to be able to parameterize a query, you have 


SSRS restricts the structure 
of the MDX queries you're 
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allowed to ۰ 


to dynamically generate your query string as an SSRS 
expression, and long expressions (it’s rare that the query 
string is short) can quickly become unreadable and 
unmaintainable. 

I know of many projects where people started out 
using SSRS as a reporting tool and then were forced to 
find an alternative that worked better with SSAS. It’s 
not clear whether the situation will be any better with 
SSRS 2008, but the initial signs are that nothing much 
will change. 

All that’s needed is for Reporting Services and Anal- 
ysis Services to be integrated properly so that a person 
can write whatever MDX query he or she likes and bind 
it to a control that will display the results. Is that too 
much to ask? SQL] 

InstantDoc ID 97290 


to exist in the first place 


ou could be forgiven for thinking that SQL 

Server Reporting Services (SSRS) and SQL Server 
Analysis Services (SSAS), two pillars of Microsoft’s BI 
toolset, would work well together. After all, if you’ve 
gone to all the trouble of building a cube, you would 
expect to be able to use it as the source for your SSRS 
reports and not just for ad-hoc querying, wouldn’t you? 
Unfortunately, anyone who has tried to create anything 
more than the most basic SSRS reports on top of SSAS 
will tell you that it can be an extremely frustrating 
experience. 

The most irritating issue (which—to make matters 
worse—is the result of a conscious design decision 
rather than a bug), is that SSRS restricts the structure of 
the MDX queries you're allowed to write: You can put 
the Measures dimension only on the Columns axis in a 
SELECT statement, and you must put all other dimen- 
sions either on the Rows axis or in the Where clause. 

True, you can subsequently use a matrix control to 
pivot your data within a report, but this is a poor substi- 
tute for being able to write the query the way you wanted 
in the first place. To use a SQL Server analogy, it’s as 
if SSRS prevented you from using a Group By clause 
in your queries and made you do any aggregation with 
built-in SSRS functionality. 

One example of the kind of problem this restric- 
tion causes occurs when you want to parameterize a 
report by the Measures dimension. When you create a 
parameter in a report, you usually want to base the list 


of available values 
ORE on the WEB 


for that parameter 
Download the listings at on the contents 
InstantDoc 10 97920. of a dataset. That 
dataset should be 
on the result of a query that returns these available 
values, one per row—but SSRS doesn’t let you write 
an MDX query with members of the Measures dimen- 
sions on the Rows axis. There is a workaround (see Web 
Listing 1—www.sglmag.com, InstantDoc ID 97920) but 
it isn't elegant and shouldn't really be necessary at all. 
Another problem occurs when you want to display 
specific combinations of members from the Measures 
dimension and members from other dimensions on the 
Columns axis in a report. For instance, you might want 
to show three columns: the measure Sales for the years 
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here is easy: You'll definitely want to jump on the 
virtualization bandwagon. Once confined to use as a 
tool for test environments or server consolidation, vir- 
tualization is emerging as a significant tool for every 
IT infrastructure. We'll discuss how virtualization is 
influencing SQL Server implementations everywhere. 

On the BI front, it’s abundantly clear that large en- 
terprises are increasingly turning to database-driven 
BI solutions to help them make better decisions about 
their businesses. The BI segment has been white-hot 
lately, with corporate acquisitions (Microsoft buy- 
ing Dundas Data Visualization) and significant new 
product releases (such as Microsoft Office Perform- 
ancePoint Server) driving even more interest and 
competition into the area. If you're tasked with de- 
ploying a BI solution in your organization, what tools 
and trends should you look for? 

As you head into the new year, you need to be pre- 
pared for the trends and tools that will determine how 
you do your job and make your company successful. 
Let's see what the future holds. 


Development 

Demand will continue to be high in the coming year 
for developers who write business applications with 
SQL Server back ends. In fact, the need for quick and 
efficient coding is so great that we're seeing the emer- 
gence of a new breed of development tools. These 
tools include products that make it easy for develop- 
ers and DBAs to quickly prototype, test, and deploy 
applications built on top of SQL Server. 

Products such as DataKrafts DataKraft Studio 
and Biro M&T's NConstruct let you create basic data- 
base-driven applications quickly without coding. For 
heterogeneous environments, you'll see cross-platform 
development tools such as Embarcadero Technolo- 
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ike a modern-day Nostradamus, IT managers and 
SQL Server administrators need to keep an eye on the 
future to make sure that their current technology and 
infrastructure investments stay relevant in the months 
and years to come. The next year promises to be event- 
ful for the SQL Server industry, so we thought that a 
bit of soothsaying and prognostication—teamed with 
sage advice from some experts—would help you navi- 
gate a sensible route through the latest technology de- 
velopments and noteworthy product releases. 

Because the upcoming release of SQL Server 2008 
will bring new developments from Microsoft that will 
cause ripples throughout the market, we also wanted 
to let you know which third-party products have the 
tools you need to stay on top of next year's challenges. 
To help you plot a safe course over the next year, we 
spoke with industry experts in three SQL Server ar- 
eas: development, administration, and business intel- 
ligence (BI). 

In the development space, new technologies and 
Microsoft’s product releases promise to make evolu- 
tionary changes in how databases are developed. The 
arrival of Language-Integrated Query (LINQ) will 
change the way database administrators and develop- 
ers work together, and the updated ADO.NET frame- 
work with Visual Studio 2008 will drive innovation by 
streamlining application development. Accompany- 
ing Microsoft's developer innovations, we're seeing 
a lot of rapid application development (RAD) tools 
coming to market, which could give overworked de- 
velopers good alternatives to traditional development 
methods. 

When it comes to database administration, how 
will the rapidly decreasing cost (and quickly increas- 
ing power) of computer hardware and disk storage 
affect your SQL Server environment? One prediction 
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and it is only a 
matter of time 
before more 
customers are 
doing the same" 


—Bogomil Balkansky, 
VMware senior director of 
product marketing 
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A look into the future 


than trying to understand the raw tables and column 
definitions a DBA may be more comfortable with. 
This keeps developers business focused.” 

Gugick also points to the introduction of LINQ 
with SQL Server 2008 as another boon for developers. 
“LINQ allows developers to query SQL Server using 
managed code in their language of choice—such as 
C or Visual Basic—rather than relying on embed- 
ded SQL statements. The end result is that developers 
will be creating SQL Server applications more quickly 
than they ever could.” 

Harried DBA developers are also increasingly 
facing issues related to time (or lack thereof), a trend 
that Gugick sees as a driving force behind their deci- 
sion to look at third-party tools to ease their work- 
load. “If you asked most database professionals what 
they want out of their third-party toolsets, I’m cer- 
tain you would hear a clear desire to have those tools 
help them get a handle on their environment, reduce 
their workload, and increase their productivity,” says 
Gugick. “In other words, saving them time.” 

To help you meet your future development de- 
mands, we’ve put together a list of some of the key 
products and tools in the XML, ASP.NET, and data- 
base utility categories. 


Administration 

New releases from Microsoft again set the stage for 
competition to heat up in 2008. First, the release of 
Microsoft System Center Data Protection Manager 
(DPM) 2007, with its support for SQL Server, will 
stir up third-party backup and continuous data pro- 
tection (CDP) solutions to differentiate themselves 
and add value. Everyone needs good backup and 
recovery solutions, and this will be the year to find 
the best fit for your situation as vendors vie for your 
business. 

Virtualization is already having a major impact 
on the IT industry, and recent trends indicate that 
virtualization will have a more significant effect on 
how databases are secured, backed up, and managed 
in the future. “Were still at the beginning in terms of 
virtualizing database applications," says VM ware se- 
nior director of product marketing Bogomil Balkan- 
sky. "About 60 percent of our customers are telling 
us that they run databases in virtual machines, and 
it is only a matter of time before more customers are 
doing the same." Balkansky explains that virtualiza- 
tion is still in its infancy, and that remaining con- 


gies' Rapid SQL. SQL Server 2008 and Visual Studio 
2008 promise to introduce a number of features that 
should benefit database developers, beginning with 
better support for the ADO.NET entity framework 
and LINQ—not to forget Visual Studio Team Edi- 
tion for Database Professionals. 

“The new ADO.NET Entity Framework enables 
more efficient software creation by giving developers 
the ability to work with logical data entities, or objects, 
that are mapped to the underlying tables and columns 
in the database,” says David Gugick, director of prod- 
uct management for SQL Server solutions at Quest 
Software. “Ву abstracting the underlying complexity 
of a normalized database, developers can work with 
meaningful objects that represent the business, rather 
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XML 

e Stylus Studio 2007 XML Enterprise Suite, 
DataDirect Technologies, www.stylusstudio.com 

* Visual Studio 2008, Microsoft, 
www.microsoft.com 


* XMLSpy 2008, Altova, www.altova.com 
ASP.NET 


* Dundas Gauge for .NET, Dundas Data Visualization, 
www.dundas.com 


* FileUp Enterprise Edition, SoftArtisans, 
www.softartisans.com 
* NConstruct, Biro M&T, www.nconstruct.com 


* ScaleOut StateServer, ScaleOut Software, 
www.scaleoutsoftware.com 


e Strangeloop AppScaler, Strangeloop Networks, 
www.strangeloopnetworks.com 


Database 

* ApexSQL Developer Studio, ApexSQL, 
www.apexsql.com 

* DatabaseSpy 2008, Altova, www.altova.com 

* DataDirect Connect for ADO.NET, DataDirect 
Technologies, www.datadirect.com 

* DataKraft Studio, DataKraft, www.datakraft.net 

* NConstruct, Biro M&T, www.nconstruct.com 

* Rapid SQL, Embarcadero Technologies, 
www.embarcadero.com 

• SQL Farm Combine, SQL Farms, 
www.sqlfarms.com 

* Toad Data Modeler, Quest Software, 
www.quest.com 
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Backup and Recovery 
* Double-Take for Windows, Double-Take Software, 
www.doubletake.com 


* HyperBac for SQL Server, Xceleon Technologies, 
www.hyperbac.com 


* LiteSpeed, Quest Software, www.quest.com 


* Microsoft System Center Data Protection Manager 
(DPM), Microsoft, www.microsoft.com 


* SQL Backup, Red Gate Software, www.red-gate.com 
* SQLsafe, Idera, www.idera.com 

* Time Navigator, Atempo, www.atempo.com 

* UltraBac, UltraBac Software, www.ultrabac.com 


Virtualization 

• ESX Server 3.5, VMware, www.vmware.com 

* HP PolyServe Database Utility for SQL Server, HP, 
www.hp.com 

* Virtual Iron 4.0, Virtual Iron Software, 
www.virtualiron.com 

* Windows Server 2008 (Windows Server virtualiza- 
tion—WSv), Microsoft, www.microsoft.com 

Hardware 

• HP ProLiant DL585, HP, www.hp.com 

* IBM System Storage N5000, IBM, www.ibm.com 

e SecureSphere, Imperva, www.imperva.com 


Check out what SQL Server MVPs call 
"Revolutionary Product..." 
"Fantastic Tool..." 


SQL SERVER TRENDS 


cerns about running databases in virtual machines 
(VMs)—such as I/O performance and management 
complexity—will continue to be addressed over time. 
Balkansky points to several areas in which virtual- 
ization is already indirectly affecting database envi- 
ronments, ranging from server consolidation in the 
data center to an increase in the availability of other 
network resources, such as storage and backup. VM- 
ware is the established leader in virtualization, but 
you can expect a flurry of new virtualization prod- 
ucts and novel solutions. 

Microsoft is also jumping into the virtualization 
space in a big way with the upcoming release of Win- 
dows Server virtualization (WSv), the hypervisor 
solution designed as part of Windows Server 2008. 
WSv requires virtualization support in the processor, 
a feature that both the Intel VT and AMD-V proces- 
sors provide. Large database applications also require 
beefy hardware. So you'll see Intel’s and AMD’s quad- 
core servers playing a leading role in all sorts of SQL 
Server solutions. 

With these trends in mind, you'll need to know 
what products are available and what products are on 
the horizon to meet your demands. We've created a 
list of some of the key players in backup and recovery, 
virtualization, and hardware. 


7 Editors Note 


Jeff and Megan would 

like to hear about your 
experiences with these 
products. Please post your 
thoughts 00/۱۱۶ ۵ 
_сот, InstantDoc 10 97289), 
or send them to jjames@ 
windowsitpro.com and 
mbearly @ windowsitpro 
com. 
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SQL Farm Combine " 


Automate rollouts, deployments, and daily admin tasks 


Move code between Dev-QA-Production: One-click 
code deployment on all databases and servers 


Deploy DB projects, execute multiple scripts 
and queries on all databases and servers 


Collect data & performance metrics from all servers 
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Dell and Microsoft have worked together for over 20 years, and this 
close partnership means we can offer a SQL Server solution that's quick 
to deploy, easy to manage, and headache-free. That's why we've sold 
more than anyone else, anywhere in the world. 
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2007. This product lets you monitor and analyze busi- 
ness performance as well as plan business activities. 
“We need to step back and look at Microsoft’s BI 
offering, which starts with SQL Server for the BI in- 
frastructure, Office 2007 (Excel, SharePoint) for end 
user delivery, and PerformancePoint Server for ana- 
lytical applications,” explains Ajenstat. “SQL Server 
is the foundation that enables our pervasive BI vision 
to become a reality and ultimately ensure that we add 
value to any decision at any levels of the company.” 


BUSINESS 
INTELLIGENCE 


Reporting Tools 
* Analyzer 2007, Strategy Companion, 


www.strategycompanion.com 
* BusinessObjects XI R2, Business Objects, 
www.businessobjects.com 


• DSP Performance Canvas 2007, DSPanel, 
www.dspanel.com 


e Dundas Chart, Dundas Data Visualization, 
www.dundas.com 


* Radius90, 90 Degree Software, 
www.90degreesoftware.com 


e RSinteract, ICS, www.rsinteract.com 


* SQL-RD, ChristianSteven Software, 
www.christiansteven.com 


* Tableau, Tableau Software, 
www.tableausoftware.com 


• XL Cubed, XLCubed, www.xlcubed.com 


Performance Management Tools 
* Microsoft Office PerformancePoint Server 2007, 
Microsoft, www.microsoft.com 


BI Servers 
* Dell PowerEdge Server preconfigured with 
SQL Server 2005, Dell, www.dell.com 


* SAS Enterprise BI Server, HP, www.hp.com 


New to the BI scene in 2007 were BI servers. HP 
announced that it would offer a server preconfigured 
with SQL Server 2005, which can be used to deploy BI 
solutions. Microsoft and Dell recently followed suit: 
Dell began producing a line of its PowerEdge servers 
preconfigured with SQL Server 2005, and will begin 
offering PowerEdge servers preconfigured with SQL 
Server 2008 following the SQL Server 2008 launch. 
Dell will also offer servers preconfigured with Perfor- 
mancePoint Server 2007 as an add-on. 

With all this activity coming from the big play- 
ers, we expect the BI market to really heat up in 2008. 
Whether you're thinking about BI solutions or are al- 
ready implementing them, you'll want to pay close at- 
tention to this market. 2008 will be the year when you'll 
be seeing lots of new BI products and initiatives. We've 
listed the products available in this arena today. EM 

InstantDoc ID 97289 
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Business Intelligence 

The coming year will be a defining year for BI prod- 
ucts because Microsoft is determined to make BI 
mainstream. According to Francois Ajenstat, Mi- 
crosoft director of product management for SQL 
Server, BI has become a priority for CIOs. “ВІ is now 
the number one CIO priority according to Gartner," 
says Ajenstat. “I believe that more users will want to 
benefit from the value of BI...proprietary and costly 
solutions are no longer required and won't enable per- 
vasive BI to become a reality." 

Microsoft's initiative is to provide an end-to-end 
solution that integrates BI from the DBA to the business 
user through Microsoft Office and Microsoft Office 
SharePoint Server 2007. Although SQL Server 2005 
includes powerful BI components, Microsoft has 
stepped up the BI features in SQL Server 2008. SQL 
Server 2008 includes a BI platform that offers data 
integration and management, enhanced reporting 
options that let you create reports in Microsoft Word 
or Excel, and analysis tools such as SQL Server Analy- 
sis Services (SSAS) data mining algorithms. 

Microsoft is out to conquer the business end of 
BI, starting with its new performance management 
release, Microsoft Office PerformancePoint Server 
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value of BI... 
proprietary and 
costly solutions 
are no longer 
required and won’t 
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Read-Gonsisteney 
PROBLEMS, PART | 


Get the lowdown on index internals, isolation 
levels, and locking—and their impact on read 


that follow. I believe they’re integral to understanding this 
article’s contents. 

Heap table. A heap table is an organization of a table 
with no clustered index. SQL Server maps heap data with 
one or more Index Allocation Map (IAM) pages that map 
the table’s data in file order. No logical ordering is enforced 
in a heap. When a new row is inserted into a heap, SQL 
Server uses a special type of page called Page Free Space 
(PFS) to locate free space in which to insert the new row. 
When you update a variable-length column in a row in a 
heap with a longer value such that the row doesn’t have 
room to expand in the original page, SQL Server will move 
the row to another page, leaving a forwarding pointer 
in the row’s original location, pointing to the row’s new 
address. It’s important to note that neither inserts nor 
updates can cause page splits (which ГЇЇ describe in a 
moment) in a heap. 

Clustered table. A clustered table is an organization of 
a table with a clustered index in a B-Tree data structure. 
For the purpose of this discussion, ГЇЇ focus on the leaf 
level of the index, which contains the full data rows. The 
pages in the leaf level are organized in a doubly linked 
list, maintaining logical index key ordering. Besides the 
linked list, SQL Server also maps the leaf level’s data 
with IAM pages in file order. The logical order of the 
pages maintained by the linked list (index key order) can 
be different from the order of the pages in the files. The 
discrepancy between index order of pages and file order of 
pages is called logical fragmentation. This fragmentation is 
measured as the percentage of out-of-order pages as you 
scan the pages in index order. An out-of-order page is one 
that appears in the file before the page that points to it in 
the linked list. 

Nonclustered index. Like a clustered index, a nonclus- 
tered index is organized as a B-Tree. The leaf level contains 
as many rows as the number of rows in the table, each with 
a copy of a subset of columns from the corresponding 
data row, plus a row locator that’s used to locate the full 
data row. The index leaf level organizes the data in a 


consistency 


concern things that we take for granted and never 
bother to check, just because they seem so trivial, 
or because we make wrong conclusions based on our 
observations and understanding of reality, or because 
we misinterpret other explanations. Try answering the 
following questions before you read on, and when you're 
done reading the article, answer them again: When you use 
the Read Uncommitted isolation (e.g., when specifying the 
NOLOCK hint) to query data, what would change in SOL 
Server's treatment of SELECT queries compared with the 
default Read Committed isolation? When you query data 
using the Read Uncommitted or Read Committed isola- 
tion, can you get the same row multiple times? When you 
query data using the Read Uncommitted or Read Com- 
mitted isolation, is it possible that your query will skip 
a row? When you examine a query's execution plan and 
see an Index Scan or Clustered Index Scan operator with 
Ordered: False, how will SQL Server scan the data? 
In this article, ГЇЇ try to shed some light on the way 
reads behave in different scenarios and the consistency 
level you should expect in each scenario. Most of this 


article's findings are 
ORE on «һе WEB 


fairly recent—in the 
See the Web figure at past year or so— 
InstantDoc Ip 97278. and my answers to 
the questions above 
before those findings were wrong or incomplete. I found 
myself surprised by the true answers to these questions (at 
least true to the best of my current understanding), and 
my current understanding led me to revisit my past choices 
and recommendations regarding reading data. I described 
some of these findings in my blog, but here ГЇЇ provide 
a more complete picture, as well as some new findings. 
But first, ГЇЇ set some groundwork with a few important 
fundamentals. 


۱ ife is full of incorrect assumptions, many of which 


Some Important Fundamentals 
Bear with me if you're already familiar with the concepts 
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Index-order scan. This type of scan can be used to scan 
index data in logical index key order—that is, scanning 
the leaf-level pages by following the linked list either from 
head to tail (forward scan) or from tail to head (backward 
scan). When this type of scan is used, the data is returned 
to the next operator in the plan (or to the caller, if that’s 
the last operator being processed) in logical index key 
order. The performance of index-order scans is affected by 
logical fragmentation since out-of-order pages require the 
disk arm to go back and forth. With zero fragmentation, 
the performance of an index-order scan should be similar 
to that of an allocation-order scan: The higher the frag- 
mentation, the worse the performance of an index-order 
scan compared with an allocation-order scan—up to 
several times slower. The only exception is when the table 
is very small (up to 64 pages); an allocation-order scan 
can be more expensive than an index-order scan because 
of some overhead involved with this type of scan. 


Consistency Problems 

with Read Uncommitted 

The use of the Read Uncommitted isolation level (e.g., 
with the NOLOCK hint) is a common practice in attempt 
to overcome blocking problems under the default Read 
Committed isolation. However, with Read Uncommitted, 
there are several changes in the way SQL Server handles 
reads that you might not be aware of. This month and 
next, ГЇЇ describe read-consistency problems under the 
Read Uncommitted isolation, resulting from inserts run- 
ning while processes are reading data. In a later article, 
ГЇЇ cover read-consistency problems under both Read 
Uncommitted and Read Committed isolations resulting 
from updates running while processes are reading data. 

A common perception programmers have regarding 
behavioral changes of reads when working with Read 
Uncommitted is that the only change is that SQL Server 
won't request shared locks when reading data. There- 
fore, readers can get “dirty data"—that is, uncommitted 
changes. But the reality is that there are other behavioral 
changes of readers besides not acquiring shared locks 
that can result in reading the same row multiple times or 
skipping rows that existed when the read started. 

These read-consistency problems have to do with changes 
in the storage engine choices when instructed by the relational 
engine (specifically by the optimizer) to perform index scans 
(both clustered and nonclustered) with Ordered: False. 

The optimizer will use an index scan whenever all the 
index leaf data is needed. When the data isn’t required to 
be scanned in order, the Index Scan operator would show 
Ordered: False in the tooltip box. When the data is required 
to be scanned in order (e.g., to satisfy an ORDER BY 
request, GROUP BY, DISTINCT, merge join), the Index 
Scan operator would show Ordered: True. As an example, 
examine the execution plans that Web Figure 1 (www.sqlmag 
com, InstantDoc ID 97278) shows for the following three 
queries: 
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doubly linked list, maintaining logical index key ordering. 
SQL Server also maps the nonclustered index leaf level's 
data with IAM pages in file order. The aforementioned 
fragmentation is also relevant to nonclustered indexes. 

Page splits. When a new row is inserted into an index 
(clustered or nonclustered), SQL Server identifies the target 
page based on the new row's index key order. If the target 
page has no room to accommodate the new row, SQL 
Server will split the page—that is, it will allocate a new page 
(before or after the current page in the file or in another 
file), move half the rows from the original page to the new 
one, insert the row to one of the two pages depending on 
the index key order of the new row, and update the pointers 
in the linked list to preserve logical index key ordering 
among the pages. The only exception to this rule is when 
the new row has a higher index key value than the highest 
existing key, in which case SQL Server won't move half 
the rows from the original page to the new one. A similar 
split process will take place if a row is expanded as a result 
of an update to a variable-length column when there's no 
room for the row to expand in the page. 

Allocation-order scan. This type of scan can be used to 
scan heap or index (clustered and nonclustered) data in file 
order based on IAM pages. With heap data, that's the only 
type of scan that can be used. With index data, that's one 
of the two possible alternatives. The performance of allo- 
cation-order scans isn't affected by logical fragmentation 
because this scan is done in file order, not in index order. 
However, the performance of allocation-order scans is 
affected by file system fragmentation of the database data 
files SQL Server Enterprise Edition supports Advanced 
Scanning (aka merry-go-round scans), which is applicable 
only to allocation-order scans (not to index-order scans, 
which ГЇЇ describe next). Advanced scanning allows a 
request for an allocation-order scan to piggyback an 
already-running allocation-order scan, and when both 
processes being served by the same scan are done, the one 
that joined later will scan the remaining data (before the 
point it joined the scan). 


THE LOGICAL PUZZLE 
NOVEMBER'S PUZZLE: CRAZY SEQUENCE 


We posted a lengthy puzzle involving a long number at our Web site (InstantDoc ID 
97007). Have you been able to determine the next number in that sequence? Let n 
be the zero-based position of the number in the sequence (0, 1, 2, 3, ...). The actual 
number in the sequence is n factorials of n—that is, О, 1!, 211, 3!!!, and so on. The 
lengthy last number in the sequence is 3!!!, so the next number in the sequence is 


it wouldn't leave space for anything else. 


DECEMBER'S PUZZLE: MINIMUM NUMBER OF WEIGHTS 

Can you determine the minimum number of weights required to measure any integer 
weight in the range 1 through 100 pounds using a scale? Also, can you generalize 
your answer for a range 1 through n pounds? 
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is not what 
we don’t 
know; it’s 
what we 
know for 
sure that 


just ain’t so. 
—Mark Twain 
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Similarly, if a page that wasn’t yet reached by the 
scan splits, and the new page is allocated before the scan 
position, the scan will end up skipping or not reading the 
rows that moved to the new page. For these potential con- 
sistency problems, in Read Committed isolation or higher, 
the storage engine would opt for an index-order scan when 
the optimizer requests an index scan with Ordered: False. 
There are exceptions, which ГЇЇ discuss later. 

The reason an index-order scan won't allow these 
problems is that the pages are read in logical index key 
order following the linked list. If a page splits after the 
scan visited it, both pages (the one that got split and the 
new one) won't be visited again because they're behind in 
logical order. If a page splits before the scan reached it, the 
scan will read both pages in order. 

An index-order scan won't be as efficient as an alloca- 
tion-order scan because fragmentation will negatively im- 
pact an index-order scan. However, these particular consis- 
tency problems won't happen. Other consistency problems 
might happen, but ГЇЇ discuss those in a later article. 

What might surprise you is that when an index scan 
with Ordered: False is running under Read Uncommitted, 
the storage engine assumes that you have no consistency 
expectations whatsoever but rather that you're aiming at 
the best possible performance. Therefore, the engine will 
use an allocation-order scan and these consistency prob- 
lems might happen if rows are inserted while the scan is 
running. There’s an exception in SQL Server 2005: If the 
index is small (64 pages or fewer), the storage engine will 
use an index-order scan because it’s more efficient than an 
allocation-order scan with so little data. Unfortunately, 
when you see an index scan in the plan with Ordered: 
False, there’s no way to tell in the plan whether the storage 
engine will, in practice, use an allocation-order scan or an 
index-order scan. 

It should be noted that table scans against a heap don’t 
incur the consistency problems that this article describes 
for the simple fact that heaps don’t incur page splits, as I 
explained earlier. However, keep in mind that clustered 
tables have some advantages over heaps—for example, 
they let you control the way inserts are distributed over 
disk drives. Also, even when the table is organized as a 
heap, if there are nonclustered indexes, those can incur 
splits, and scans of such indexes could incur the aforemen- 
tioned read-consistency problems. 


Ready for Examples? 
Next month, ГЇЇ provide tangible examples with code 
you can run to see the problems with your own eyes, and 
ГЇЇ suggest alternatives to avoid these problems. Several 
people are due thanks for their part in shedding light on 
the read-consistency problems I’ve covered in this article: 
Lubor Kollar, Srikumar Rangarajan, Tony Rogerson, 
Davide Mauri, Andrea Benedetti, Gianluca Hotz, Maciej 
Pilecki, and Paul Randal. SQL 
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USE AdventureWorks; 

-- Query 1 

SELECT * FROM Sales.SalesOrderHeader; 

-- Query 2 

SELECT SalesOrderNumber FROM Sales. 
SalesOrderHeader; 

-- Query 3 

SELECT * FROM Sales.SalesOrderHeader 
ORDER BY SalesOrderID; 


Observe that the first two index scans (1.е., first clustered, 
second nonclustered) show Ordered: False, whereas the 
third shows Ordered: True. With the first two scans, as far 
as the optimizer is concerned, the data can be returned in 
any order, but with the third scan, it needs to be returned 
in index key order. 

Ordered: False doesn't necessarily mean that the 
storage engine in charge of the actual row operations 
(fetching the data) will utilize the more efficient alloca- 
tion-order scans based on IAM pages. The storage engine 
needs to also take into account your consistency expecta- 
tions based on the isolation level the query runs under. 
An allocation-order scan running while inserts are issued 
against the table/index might result in getting the same row 
multiple times or skipping rows. 

As an example of getting the same row multiple times, 
suppose you have the following index keys in pages of 
an index shown in file order: Page#101(keys: 50, 60, 70, 
80), Page#107(keys: 10, 20, 30, 40), Page#121(keys: 90, 
100, 110, 120). The logical index order of the pages is 
Page#107(keys: 10, 20, 30, 40)>Page#101(keys: 50, 60, 
70, 80)>Page#121(keys: 90, 100, 110, 120). Suppose an 
allocation-order scan starts and reads pages 101 and 107. 
So far, it has read the rows with the keys 50, 60, 70, 80, 
10, 20, 30, 40. At this point (before the scan is finished), 
another process inserts a row with the key value 65. Sup- 
pose the target page 101 is full, so it undergoes a split. 

It's important to note that їп a Read Committed isola- 
tion, shared locks aren't kept until the statement 15 done but 
rather until SQL Server finishes reading the locked resource 
(row or page). So, even if SQL Server decides to lock pages 
when reading, once the page is read, the lock is released. 
The split process allocates a new page (can be either 
before or after the page that got split in the file)—say, page 
135—and moves half the rows from the source page to the 
new one, and inserts the new row to one of the two pages 
based on its key value. The layout of the data in the file is 
now Page#101(keys: 50, 60, 65), Page#107(keys: 10, 20, 30, 
40), Page##121(keys: 90, 100, 110, 120), Page#135(keys: 70, 
80). The linked list will reflect the right logical index key 
order, so the layout in the index is Page#107(keys: 10, 20, 
30, 40)2 Pagezt101(keys: 50, 60, 65) Page#135(keys: 70, 
80)>Page#121(keys: 90, 100, 110, 120). The scan continues 
reading the rest of the pages—121 and 135, reading the 
rows with the keys 90, 100, 110, 120, 70, and 80. You realize 
that the rows with the keys 70 and 80 were read twice. 
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Managing 


Plan-Cache 


Monitor and control cache size 
for optimum performance 


interface. Prepared queries let SQL Server store one plan 
for many queries that differ only in the actual values used 
for qualifying data in the query’s WHERE clause and 
greatly reduce the cache space needed. 

SQL Server 7.0 limited SQL Server’s total memory to 
2GB, which in effect limited SQL Server's plan-cache size. 
The 2GB limit exists because on 32-bit systems, a 32-bit 
address space can directly address only 4GB of memory, 
and 2GB is normally reserved for the О$% own uses, with 
the other 2GB available for applications, such as SQL 
Server. SQL Server 2000 added the ability to configure the 
use of Address Windowing Extensions (AWE) memory, 
which lets SQL Server use more than 4GB if the OS has 
more than 4GB of memory. However, since plan cache can’t 
use AWE memory, youre limited to only 2GB of memory 
for plan cache, or 3GB if you used the /3GB switch. (For 
more information about controlling the total amount of 
memory that SQL Server 2000 and 7.0 can use, 
see the Microsoft article “How to configure SQL 
Server to use more than 2 GB of physical memory” 
at support.microsoft.com/?kbid=274750.) A 64- 
bit system doesn’t have any of these limitations 
because it can directly address far more memory than SQL 
Server or Windows can support. 

In many cases, limiting memory to only the non-AWE 
memory meant that SQL Server 2000 didn’t have enough 
memory to store all the plans that should be in cache. SQL 
Server 2005 tried to improve this situation by allowing far 
more memory to be used for plans. However, it appears 
that Microsoft went too far and ended up allowing too 
much memory to be used for plans—and so in SQL Server 
2005 SP2, Microsoft changed the sizing algorithm again. 
Let's look at the SQL Server 2005 algorithms to limit the 
size of plan cache. 


Plan-Cache Controls in SQL 
Server 2005 

Although plan cache’s size isn’t directly limited in SQL 
Server 2005, SQL Server does determine a limit at which 


the ability to cache procedure plans and other 

types of plans in what’s known as plan cache. Plan 
caching lets SQL Server cache and reuse various types of 
plans to avoid recompilation—and its associated perfor- 
mance hit on SQL Server—wherever possible. (For some 
background about plan caching, see the sidebar “A Brief 
History of Plan Caching in SQL Server,” page 28.) SQL 
Server 2000 and 7.0 have an upper limit on plan-cache 
size. SQL Server 2005 doesn’t have a specific upper limit 
on plan-cache size but provides some controls to keep plan 
cache from getting overly large, so that it doesn’t use too 
much memory and thus negatively affect performance. 
Let’s take a closer look at these controls, to help you pay 
more attention to plan-cache size so that you can consider 
it in your SQL Server performance-tuning efforts. ГЇЇ base 
my discussion about plan-cache limits in part on informa- 
tion in Inside Microsoft SOL Server 2005: Query Tuning 
and Optimization (Microsoft Press, 2007). 


© ince version 7.0, SQL Server has long provided 


Memory and Plan-Cache Limits 

SQL Server 2005’s lack of an upper limit on plan-cache 
size could potentially be detrimental, especially since 
SQL Server can cache many small, ad hoc plans that 
might never be reused. Using a large amount of memory 
for many hundreds or thousands of ad hoc plans is 
called plan-cache bloat. In addition to using an епог- 
mous amount of memory, cache bloat can affect you in 
another way. Having hundreds or thousands of similar 
plans, SQL Server might spend an inordinate amount of 
CPU time simply trying to determine whether there’s a 
plan in cache for your current query. For these reasons, 
I strongly recommend that whenever possible you don’t 
base your applications on ad hoc queries. Creating stored 
procedures gives you the most control over plan reuse, 
but you might not be able to use stored procedures for 
everything. The second-best option is to use prepared 
queries, either through your application (using prepare 
and execute methods) or using T-SQL’s sp executesql 
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it considers the system to be under memory pressure. 
When it detects memory pressure, SQL Server will start 
removing plans from cache to free up memory. When 
discussing memory pressure, we refer to the term visible 
memory—that is, the directly addressable physical memory 
available to the SQL Server buffer pool. On a 32-bit SQL 
Server instance, the maximum value for visible memory is 
either 2GB or 3GB, depending on whether you have the 
/3GB flag set in your boot.ini file. Memory with addresses 
greater than 2GB or 3GB is available only indirectly, 
through AWE-mapped memory. On a 64-bit SQL Server 
instance, “visible” memory has no special meaning, as all 
the memory is directly addressable. If I refer to visible 
memory greater than 3GB, keep in mind that this is pos- 
sible only on a 64-bit SQL Server system. 

The term target memory means the maximum amount 
of memory that can be committed to the SQL Server 
process. Target memory refers to the physical memory 
committed to the buffer pool and is the lesser of the 
values you've configured for *max server memory" and 
the total amount of physical memory available to the 
OS. Thus, visible target memory is the visible portion of 
the target memory. Query plans can be stored only in the 
non-AWE-mapped memory, which is why the concept of 
visible memory is important. 

Table 1 shows how to determine the plan-cache pres- 
sure limit in SQL Server 2005 and SQL Server 2000 and 
indicates the change in SQL Server 2005 SP2, which 
reduced the pressure limit percentage when larger amounts 
of memory are available. Be aware that these formulas are 
subject to change again in future service packs. 

The details of exactly what SQL Server will remove 
from cache when cache pressure is reached are beyond 
the scope of this article. You can find more information 
about this topic, including details about different types of 
memory pressure, in Chapter 5 of Inside Microsoft SOL 
Server 2005: Query Tuning and Optimization. However, 
the bottom line is that pressure is signaled a lot sooner in 
SQL Server 2005 SP2 than in the initial release or in SP1. 
Let's consider an example. 

Assume you're on SQL Server 2005 SP1, on a 64-bit 
SQL Server instance with 28GB of target memory. Per the 
formula in Table 1, the plan-cache pressure limit would be 
75 percent of 8GB plus 50 percent of the target memory 
over 8GB (in this case, 50 percent of 20GB)—that is, (.75 
x 8GB) * (.5 x 20GB) < 6GB + 10GB < 16GB. On SQL 
Server 2005 SP2, on the 64-bit SQL Server instance with 
28GB of target memory, the plan-cache pressure limit 
would be 75 percent of 4GB plus 10 percent of the target 
memory over 4GB (or 10 percent of 24GB)—that is, (.75 
x 4GB) + (.10 x 24GB) = 3GB + 2.4GB = 5.4GB. 


Ways to Trim Down Plan Cache 

So what can you do when plan cache gets bloated? You 
can remove all your plans from cache by using the DBCC 
FREEPROCCACHE command, but often this method 
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B i MANAGING PLAN-CACHE SIZE 


A BRIEF HISTORY OF PLAN CACHING 
IN SQL SERVER 


In SQL Server's early days, the ability to reuse compiled plans for stored procedures 
was one of the product’s most heavily marketed features. As | described back in Sep- 
tember 1999 (see “Inside SQL Server: SQL Server 7.0 Plan Caching,” InstantDoc ID 
5915), although reusing procedure plans was (and still is) a good thing, it wasn’t the 
only—or even the most important—reason for using stored procedures. Starting with 
SQL Server 7.0, the cost of optimization became orders of magnitude more expensive 
compared with earlier versions, because Microsoft rewrote the optimizer and added 
many new execution techniques that the optimizer needed to consider. The increased 
cost of optimization made it more important than ever for SQL Server to avoid recom- 
piling queries whenever possible. As | further discussed in “Inside SQL Server: SQL 
Server 7.0 Plan Caching,” SQL Server 7.0 began caching and reusing other types of 
plans, in addition to stored procedure plans, to avoid recompilation. The other types of 
plans include plans for ad hoc queries and for autoparameterized or prepared queries. 


Another change introduced in SQL Server 7.0 was how the size of procedure 
cache is determined. (In general, we now call procedure cache plan cache because 
it caches more than just the plans for procedures.) Before SQL Server 7.0, a system 
administrator would configure a total amount of memory for SQL Server to use. Then 
within this total memory, the administrator would configure the memory to be used for 
certain specific purposes, such as user connections, open objects, locks, and devices. 
Whatever memory remained was used for cache and was split between data cache and 
procedure cache. The final configuration that the administrator did was to set a per- 
centage that specified how this remaining memory was to be split. The default of 30 
meant that 30 percent of this remaining memory would be used for procedure cache. 

Back before SQL Server 7.0, in many cases the default of 30 percent of the 
remaining memory for procedure plans turned out to be far too large. Since only 
procedure plans were cached, and the plans were much less complex than in current 
versions, SQL Server rarely needed this much space for plans. Keeping most of the 
procedure-cache space unused meant there was less room for caching data pages, 
which could negatively affect your query performance. A good administrator could set 
the procedure cache percentage at a lower value, but the lowest possible value was 
one percent. On some of the biggest systems that SQL Server was running on at the 
time, with the largest amounts of possible memory, even one percent was too much 
memory, and SQL Server still ended up wasting memory. 

But as mentioned, SQL Server 7.0 changed things. Not only could SQL Server cache 
other kinds of plans besides procedures, but the methods of configuring and sizing 
memory changed completely. As of 7.0, the total size of your SQL Server system’s 
memory can be dynamically determined based on your needs, but within that total 
memory, SQL Server will use memory for purposes other than data buffers only when it 
needs the space. When SQL Server no longer needs space for other uses such as plans, 
user connections, and open objects, it will return the space to the buffer cache. 
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TABLE |: Determining Plan-Cache Pressure Limit 
in Different SQL Server Versions 


SQL Server Version 
SQL Server 2005 RTM and SP1 


Plan-Cache Pressure Limit 
.75 of visible target memory from 068-865 + .50 of visible 
target memory from 8068-6405 + .25 of visible target memory 


> 64GB 
SQL Server 2005 SP2 ‚75 of visible target memory from 0—4GB + .10 of visible target 
memory from 4GB—64GB + .05 of visible target memory > 64GB 
SQL Server 2000 SQL Server 2000 4GB upper cap on the plan cache 
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LISTING 2: Prepared Query Using a 


N'SELECT * FROM Sales WHERE ContactID > AID", 


LISTING 3: Prepared Query Using a 
Less-Restrictive Value 


N'SELECT * FROM Sales WHERE ContactID 


LISTING 4: Inspecting the Prepared 


SELECT usecounts as uses, sql FROM 


= db id('AdventureWorks') 
= 'Prepared' 


LISTING 5: Creating a Plan Guide 
for the Cached Plan 


8module or batch = NULL, 


EXEC sp create plan guide 
= N'RemovePlan', 
< N'SELECT * FROM Sales WHERE 
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the same plan, as you can 
see in Listing 3. To verify 
that SQL Server uses the 
same plan for both queries, 
you can inspect the plan 
cache, by running the code 
in Listing 4. When I run 
this code, I get the results 
that Figure 1 shows. 

If you realize that you 
don't want this plan in cache 
because you don't want to 
always use the plan for the 
selective value, you can 
create the plan guide that во 
Listing 5 shows. As soon as 
the plan guide is created, it 
will remove the plan for this 
query from cache, as you 
can verify by running the 
code in Listing 4 again. 

If you leave the plan 
guide in the database, the 
RECOMPILE hint will 
force SQL Server to always come up with a new plan for 
queries that have the same form as the query in the @stmt 
variable. If you only want to remove the existing plan, but 
not keep the plan guide around, you can remove the plan 
guide by using the following statement: 


Restrictive Value 


DBCC FREEPROCCACHE 
GO 
EXEC sp_executesql 


۱۱۱۵10 int', 
AID = 3 
60 


EXEC sp_executesql 


> 8ID', 
N'8ID int', 
210 - 200 


Plans in Plan Cache 


sys.syscacheobjects 
WHERE dbid 

AND objtype 
60 


EXEC sp_control_plan_guide 
N'drop', ۲ 


Кеер ап Еуе 

оп Сасһе 

Caching and reusing query 
plans can have both costs 
and benefits. If your plans 
arent cached and reused, 
SQL Server can spend far 


USE AdventureWorks 


aname 

astmt 
ContactID < م010۲‎ 

atype = ۱۷۲۹۵۲ 


@params << ۲ ۲ 


too much time optimizing ahints = N'OPTION(RECOMPILE)'; 
your queries—so you 

need to make 

sure you have uses sql 

enough mem- 2 (AID int)SELECT * FROM Sales 


Figure 1 


Results of plan-cache 
inspection 
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WHERE ContactID < аїр 


ory available 
for all your 
useful plans. If you cache and reuse plans inappropriately, 
you might not have the best plan for your queries, and 
their performance could suffer. Having too many plans 
in cache can waste both memory and CPU resources. 
Understanding how plan-cache size is determined and 
how to make sure plans are reused only when appropriate 
can help you better control your SQL Server system and 
make optimum use of all its resources. 500 
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can hurt overall system performance as much as help it. 
Another option is to clear only the plans from one data- 
base by using the command DBCC FLUSHPROCINDB 
(database_id). 

SQL Server 2008 will provide an option that lets you 
clear a single plan from cache. The details of exactly how 
this will be accomplished, and how you can specify which 
plan to remove, aren't yet available. However, you can use an 
existing feature in SOL Server 2005 to remove a single plan 
for plans that aren't ad hoc. (You can easily remove stored 
procedure plans by using sp recompile, or by altering the 
stored procedure definition, even if the ALTER doesnt 
change anything.) Prepared plans, whether they're created 
through autoparameterization, by using sp_executesql, or 
by using prepare and execute methods in your application, 
can be removed by creating a plan guide that matches the 
parameterized query, then dropping the plan guide. (For 
more information about plan guides, see "Controlling 
Parameterization," July 2007, InstantDoc ID 96349.) 

Let's look at an example that shows how to clear 
a single plan from cache in SQL Server 2005. First ГЇЇ 
create a copy of the Sales.SalesOrderHeader table in the 
AdventureWorks database and build a nonclustered, non- 
unique index on the new table, as Listing 1 shows. Next, 
ГЇЇ create a prepared query, which selects a very restrictive 
value that returns only a few rows, as Listing 2 shows. 
Then, ГЇЇ create a second query, which uses a much less 
restrictive value, but because it is a prepared query will use 


LISTING ۱: Creating a Table and Building 
an Index on It 


USE AdventureWorks 


GO 
SELECT * INTO Sales FROM Sales.SalesOrderHeader 
GO 
CREATE INDEX Contact indx on Sales(ContactID) 
GO 
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INDEXED VIEWS 


If you’re a SQL Server 2005 user, you’re prob- 
ably thinking, "I can just use table partitioning 
instead of indexed views.” When you were trained on 
SQL Server 2005, the instructor probably told you to 
replace SQL Server 2000 partitioned views with SQL 
Server 2005 partitioned tables. Partitioned tables phys- 
ically separate rows; partitioned views logically separate 
rows. Just as in “Supertypes and Subtypes” where we 
had disjointed and overlapping conditions, you might 
have rows in the dimension that participate in more 
than one role (i.e., an overlapping condition). If you 
ever have an overlapping condition, your best approach 
is to use indexed views, even in SQL Server 2005. 


Data Warehousing: 


DIMENSIONAL 
Role-Playing 


When a single dimension needs to be part of 
a fact table multiple times 


tially capture the retail events because OLTP tables are more 
highly normalized than the tables in Figure 1. Also, the 
join or associative table, which is the OLTP counterpart to 
the OLAP fact table, would typically have fewer attributes 
than this fact table. These tables are containers that hold 
the source data for the OLAP Services/Analysis Services 
multidimensional databases (cubes). Another name for a 
multidimensional database is the staging database or data 
store. The staging database receives verified and validated 
data via the extraction, transformation, and loading (ETL) 
processes from various external data sources, including 
OLTP databases. (This staging database is also an excellent 
reporting database because the veracity of its data content 
and the logical orderliness of its architecture make sense 
to business users. Business users armed with Microsoft 
Excel can build ad hoc reports without having to rely on 
overworked business intelligence (BI) programmers and 
analysts.) 

Take a look at the views in Figure 1, which you 
can identify because view names start with “v” (eg., 
vSeller). (Note that in Figure 1, dimension table names 
start with “dim” and fact table names start with "fact.") 
Гуе built views on each of the three dimensions, Time, 
Person, and Location. In this star schema, I’m linking the 
views rather than using the 
dimensions to link to the 
fact table. Now it's easy to 
visualize that each shipment 
has a buyer and a seller, a 
starting and an ending loca- 
tion, and multiple date and 
time events associated with 
each record in the fact table. 
The views (aka roles) serve 
as virtual dimensions. 

When creating a dimen- 
sional role, you can filter 
data. For example, if the 
company won't ship orders 


ole-playing isn't limited to the Broadway stage; 

data warehouse dimensions can also assume 

roles. Dimensional roles can be used not only 
for precision when you're defining the purpose or func- 
tion of a set of rows from a dimension, but also to easily 
maintain the data because there's only one table to manage 
instead of many tables to synchronize. 

Dimensional role-playing occurs when a single dimen- 
sion needs to be part of the same fact table many times. 
For example, dates and times appear in most types of 
analysis because business activities happen in a timeframe 
and objects exist in time. Time is almost always used when 
calculating and evaluating measures in a fact table. Each 
event constitutes a link to the time dimension. For example, 
a customer order was taken on a specific date; the order was 
picked and packed at the warehouse on one or more dates; 
the order was shipped to the customer on a date; the order 
has a delivery date; the order’s payment was received on a 
date; and sometimes, the order was returned on a date and 
a refund was issued on a date. АП of these dates need to be 
recorded, stored, and analyzed because they all play a role 
in the retail sale life cycle. 


Defining Roles 

When many same-type events, such as dates, are present in 
a single fact table, you have an accumulating snapshot fact 
table. (Of course, there are other types of fact tables, which 
ГЇЇ discuss in upcoming articles.) Figure 1 , page 32, shows 
the Shipping star schema. The fact table (1.е., factShip- 
ping) is an accumulating snapshot; it contains dates for 
multiple events, such as scheduled and actual departure 
and arrival dates and times, where each shipment starts 
and ends, and the buyer and the seller. We could record 
a lot more discreet date and time events in this fact table, 
but no others are necessary for this example. 

This star schema isn't a cube; it's a set of relational tables 
that can be managed and manipulated using SQL Server 
Enterprise Manager or SQL Server Management Studio 
(SSMS). Clearly, it isn't the OLTP tables that are used to ini- 
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lishes the subtype, which then ensures that only buyers are 
listed in the vBuyer dimensional view and only sellers are 
listed in the vSeller dimensional view. 

Each of the roles that these dimensions play in the 
accumulating snapshot fact table is a little different, 
and it’s important to relabel the view dimension and its 
attributes to better express the role meaning. I’ve done 
that in Figure 1: For example, the time dimension has 
morphed into vScheduledShipDate, and TimeKEY has 
become SchedShipDateKEY. There’s little chance that a 
business user would use vScheduledShipDate under the 
assumption that it represented arrival times. Remember, 
one characteristic of a data warehouse is ease of under- 
standing, and simple naming conventions make the roles 
easy to understand. 

The second characteristic of a data warehouse is 
performance. As anyone who works in data warehousing 
will tell you, proper indexing is crucial to good perfor- 
mance, so you should consider indexing views. Because 
of the simplicity of the dimensional table structure—a 
surrogate key, a flattened set of attributes—the dimen- 
sions lend themselves nicely to serving as base tables for 
indexed views. Also, the indexed view is dynamically 
updated as the base table changes, so you only have to 
manage the dimension table when you're executing ETL 


operations. 

۳3 Dimensional 
i k> identity = à 

uus seu | Role-Playing 


bit 

smallint 
tinyint 
tinyint 
smallint 
nvarchar(10) 
nvarchar(10) 
tinyint 
tinyint 
char(4) 
tinyint 
char(4) 
tinyint 
tinyint 
smallint 


not null 
null 
null 
null 
null 
null 
null 
null 
null 
null 
null 
null 
null 
null 
null 


If you're still having trouble under- 
standing how dimensional role- 
playing works, visualize the last 
time you flew in an airplane. There 
was a scheduled departure date/ 
time, an actual departure date/time, 
a scheduled arrival date/time, and 
an actual arrival date/time. Then 
there was the time on the ground 
at the point of departure (calcu- 
lated as the time interval between 
when the jet pulled away from 
the gate until the wheels were off 
the ground), the time spent in the 
air traveling from the departure 
airport to the arrival airport, and 
the time on the ground at the 
point of arrival (calculated as the 
time interval between touch down 
and when the jet docked at the 
gate). These times are critical for 
conventional airline flight analysis, 
and each would have an entry in 
a fact table that would be used 
to calculate airline flight on-time 
percentages. Б 


InstantDoc ID 97272 


dimPERSON 
numeric 
nvarchar(20) 
int 
nvarchar(50) 


<pk> identity 
null 
null 
null 


dimLOCATION 

numeric identity 
nvarchar(20) null 

int null 
nvarchar(50) null 


<pk> 
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on weekends or holidays, it's easy to constrain the shipping 
date views with the following selection criteria: 


select 

from dimTIME 

where DayNumberOfWeek between 2 and 6 
and Holiday = 0 


Or you can separate buyers from all other people when cre- 
ating the vBuyer view by running the following command: 


create view vBuyer 
AS 

select 

from dimPerson 
where PersonType 


'buyer' 


Editor's Tip 


Michelle wants 
your input on 
this series! 

Join the forum 
discussion 21 ۵ 


.com/go/dbdesign. 


—Megan Bearly, 


Supertypes and Subtypes 

If something seems familiar about this discussion of 
dimensional roles, the reason might be that Гуе discussed 
the concepts of supertypes and subtypes in previous col- 
umns. Dimensional role-playing combines data subtyping 
and filtering into one operation. For example, in Figure 
1, dimPerson is a supertype and vBuyer and vSeller are 
virtual subtype entities. Using the WHERE criterion, the 
WHERE clause of the query that creates the subsets of 
data for each of the views acts as a data filter and estab- 


vScheduledShipDate 


SchedShipDateKEY 
FullDate 

DayName 
MonthName 
CalendarQuarter 
CalendarYear 
FiscalQuarter 


dimPRODUCT 

numeric <pk> identity 
nvarchar(20) «ak» not null 
nvarchar(50) null 
money null 
money null 
nvarchar(50) null 
nvarchar(50) null 
nvarchar(50) null 


TimeKEY 

FullDate 

Holiday 
DayNumberofDay 
DayNumberofWeek 
DayNumberofMonth 
DayNumberofYear 
DayName 


(۱ 
ProductCode 
ProductName 
ListPrice 
StandardCost 
Color 

Size 
SizeRange 


3 Weight float null MonthName 
FiscalYear WeightUnitMeasure nvarchar(50) null MonthNumberofYear 
dimTIME CalendarQuarter 


CalendarYear 
FiscalQuarter 
FiscalYear 
AcademicQuarter 
AcademicSemester 


factSHIPPING AcademicYear 


numeric 
numeric 
numeric 
numeric 
numeric 
numeric 
datetime 
numeric 
datetime 
numeric 
datetime 
numeric 
datetime 
numeric 
int 

int 
smallint 


OX shippingstatusKEY 
BuyerKEY 
SellerKEY 

«2 ShipFromLocationKEY 
ShipToLocationKEY 

ZZ] SchedShipDateKEY 

NJ SchedShipTime 
SchedArriveDateKEY 
SchedArriveTime 
ActualShipDateKEY 
«2 ActualShipTime 

O< ictualtriveDateKEY 

ActualArriveTime 

ProductKEY 

NbrCasesShipped 

NbrUnitsShipped 

NbrUnitsPerCase 


<pk> i 


PersonKEY 
PersonType. 
PersoniD 
PersonName 


امس 


LocationKEY 
LocationType 
LocationlD 
LocationName 


ShipToLocationKEY 
LocationType 
LocationlD 
LocationName 


dimLOCATION 
vShipFromLocation 


ShipFromLocationKEY 
LocationType 
LocationlD 
LocationName 


dimLOCATION 


VSeller 


SellerKEY 
PersonType 
PersonlD 
PersonName 

Е dimPERSON 


BuyerKEY 
PersonType 
PersonlD 
PersonName 


dimPERSON 


assistant editor 


vScheduledArriveDate 


SchedShipDateKEY 
FullDate 

DayName 
MonthName 
CalendarQuarter 
CalendarYear 
FiscalQuarter 
FiscalYear 
dimTIME 


vActualShipDate 


SchedShipDateKEY 
FullDate 

DayName 
MonthName 
CalendarQuarter 
CalendarYear 
FiscalQuarter 
FiscalYear 


dimTIME 


vActualArriveDate 


SchedShipDateKEY 
FullDate 

DayName 
MonthName 
CalendarQuarter 
CalendarYear 
FiscalQuarter 
FiscalYear 
dimTIME 


Figure | 


Views in the Shipping star schema 


32 December 2007 


РА 


Alan Sugano 
Alan Sugano (asugano@adscon.com) is the 


president of ADS Consulting Group, which 
specializes in networking, custom program- 
ming, Microsoft .NET Web development, and 


SQL Server development. 


December 2007 33 


RUNNING SQL Server 


ina VIRT WZ 


ENVIRONMENT 


Optimize server performance 
with these virtualization tips 


often goes unused. Using one physical server per SQL 
Server installation significantly increases your hardware 
costs, and if each physical server is placed on a service 
contract, your maintenance costs are also higher. And 
the greater number of physical servers consumes more 
electricity and requires more physical space than a vir- 
tual server setup does. 

Virtual server advantages. By consolidating several 
virtual server guests onto a single physical server, you 
can save a significant amount of hardware cost and 
provide better utilization of hardware resources. When 
determining where a virtual guest should reside, pick a 
physical server that hosts complementary guests. For 
example, if a SQL Server installation will place a signifi- 
cant amount of I/O stress on the processor, run it on a 
physical server that hosts other virtual guests that don't 
require the same type of I/O processing. 

Virtual servers can lower your hardware maintenance 
costs because you'll have fewer physical servers to main- 
tain, as well as lower electric bills and power require- 
ments. With the cost savings, you might be able to place 
all your host servers on a 24x7 support contract compared 
with a 8x5 support contract if you're not using virtual 
servers. Even with this increased service level, you'll likely 
have a net savings for hardware maintenance. 

Virtual servers also make bare metal restores easier. 
Every virtual server hardware configuration is consistent 
regardless of the host's physical hardware configuration. 
Therefore, you simply copy the virtual server guest files 
to another physical host. 

Virtual server disadvantages. With VMware's VMware 
Server and Microsoft Virtual Server 2005, you can allo- 
cate a maximum of approximately 3.8GB of memory to 
each virtual server guest. With VMware ESX Server, 
you can allocate a maximum of 16GB of memory to 
a virtual server guest. If your SQL Server deployment 
requires more memory than these limitations, the server 
is probably not a good candidate for virtualization. 

Because all virtual server guests must share the host 


erver virtualization is a hot topic. Most of the new 

server installations I perform for my consulting 

clients have some type of virtualization. SQL 
Server is unique because of the potential processor, disk, 
and memory demands it can place on hardware. SQL 
Server is one of the few Microsoft server applications 
that has the potential to place a significant load on the 
processor, especially if your application deals with large 
tables or high transaction volume. Therefore, running 
SQL Server in a virtual environment is an option many 
enterprises will want to consider. 


Physical vs. Virtual 

Most SQL Server installations don't use 100 percent 
of a server's resources all the time. In fact, many SQL 
Server applications use only a percentage of a server’s 
processing potential. SQL Server processor utilization 
tends to be high for short periods of time. The current 
version of servers that are available with quad-core 
processors and significantly faster Serial Attached SCSI 
(SAS) drives capable of 3Gbps transfer rates represent 
a quantum leap in server performance. Virtualization 
can take advantage of the processing power of this new 
generation of servers, providing significant leverage for 
your hardware dollar. 

Physical server advantages. Traditionally, SQL Servers 
run on dedicated hardware. This gives the server full 
access to all the processing power on the server. Running 
SQL Server on a physical server means that the server 
doesn't need to compete with other virtual servers for 
resources, and a physical server will perform approxi- 
mately 10 percent to 25 percent faster than a virtualized 
server on the same hardware. SQL Server 2005 Standard 
and Enterprise versions can address the maximum 
amount of memory that the server and OS will support. 
Also, a hardware failure on the server will affect only one 
SQL Server installation. 

Physical server disadvantages. A physical server can 
waste resources because the server’s processing power 
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—INFRASTRUCTURE LOG 


_DAY 69: All we need is one specific piece of info. Gil 
almost had it, but his hand cramped. How are we supposed 
to find trusted business info when these massive volumes 
of disparate, conflicting information keep pouring in? 


_Gil just grabbed a stuffed panda. 


_DAY 71: The answer: IBM solutions for leveraging information. 
Now we can cleanse info and standardize source data fields 
for consistency and accuracy. I can create a single, 
comprehensive and accurate record of info across our source 
systems. Finally, I can provide a unified, trusted view of 
our information so everyone can make better decisions. 


_Our view of our data is now scalable. Just in time — I think 
we ran out of quarters. 


Information Management Download the Innovation and Competitive Advantage white paper: 
g IBM.COM/TAKEBACKCONTROL/ACCURATE 
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Drive.vmdk” if you're using VMware Server; “SQL1 
C Drive.vhd” if you’re using the Microsoft product). 
3. Create other virtual server disk files as necessary. 
Often you want the server to have a D drive or other 
drives to store data, depending on the server’s role. 
4. Create a new virtual server guest on the host. 
5. When prompted for the disk information, select 
the virtual server disk image you just copied and 
renamed. 
6. Start the virtual server guest. 
7. Set the IP address on the server to a static address. 
8. Rename the virtual server guest to its permanent 
name. 
9. Join the virtual server guest to the domain. 


If you've properly created your base images, it should 
take you about 15 minutes to bring up a new server. You 
can then install SQL Server or other applications you 
might need. Ideally, you should place these images on a 
SAN or other file server share so that all new hosts have 
convenient access to them. Consider using ISO images 
of the installation CD-ROMs of SQL Server (and other 
applications) to speed up the installation of new servers. 
Properly configured base images can save you an hour 
or more of set up time per server—and much more if you 
consider any required hardware installation. 


Hardware Configuration Tips 

The latest generation of servers, with quad-core proces- 
sors and SAS hard drives, represents a quantum leap 
forward in processing power, which makes them ideally 
suited for virtual server hosts. Use the following virtual 
server host hardware configuration tips to get the best 
performance from your virtual server guests. 

Host server memory. Install adequate memory for 
your host server and virtual server guests. Most of our 
virtual server host servers start out with at least 8GB of 
memory. Make sure that the host OS has the capability 
of addressing all the physical memory in the server. The 
standard 32-bit version of Windows 2003 can address 
only 4GB of memory, but the standard x64 version of 
Window 2003 can address 32GB of memory. 

Host OS. If you plan to use Windows Server as the 
host OS, consider installing the x64 version of Windows 
2003. Windows 2003 Standard x64 supports 32GB of 
memory and Windows 2003 Enterprise x64 supports 
1TB of memory. The x64 versions are mandatory if you 
want to run any x64 virtual server guests, but even 32- 
bit guests will realize some performance gain by using 
Windows 2003 x64 as the host OS. Today, only VMware 
Server and ESX Server support x64 guests; Virtual 
Server 2005 supports only 32-bit virtual server guests. 

Processor. Use quad-core processors for the best 
virtual server performance. Quad-core processors are 
typically about 50 percent faster than the equivalent 
dual-core processor. If you plan to run any x64 virtual 
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server's physical hardware resources, if too many guests 
are running on a host server, performance will suffer. 
Avoid placing too many guests on one host or guests 
that require concurrent hardware resources. 

Fault tolerance is another virtual server disadvantage. 
A hardware failure will take down all guests running on 
the host. However, if you use VMware Server or Virtual 
Server 2005 and you have 
a backup of each virtual 
server's guest file, you can 
restore these files onto a 
different host and quickly 
bring up the failed guests. 


Virtual Guest 
Placement 

When planning the place- 
ment of your virtual 
servers, try to select vir- 
tual server guests that have 
hardware resource require- 
ments that are complemen- 
tary to each other. For 
example, a SQL Server 
application that contains 
a large number of stored 
procedures and triggers 
with a relatively small data- 
base with light transaction 
volume will have higher processor requirements, but 
relatively light disk requirements. This SQL Server 
instance could be run in the same host as a file and 
print server that stresses the processor very little but 
may require high disk performance to serve files. Take 
into consideration how many users will access each 
virtual server and how these servers are stressed during 
the day. For fault tolerance, don't place all of your most 
important servers on a single host. If your server envi- 
ronment is complex (1000+ servers) you might want to 
consider one of the server metric and design tools such 
as CIRBA (www.cirba.com/product/product-overview 
.htm) This tool helps automate the process of server 
placement, host design, and performance tuning of vir- 
tual server guests and hosts. 


Creating Virtual Server Guests 

I suggest that you create a base image for each type of 

server guest you plan to use. The “Creating a Virtual 

Server Base Image,” checklist shows the basic guidelines 

Ifollow when building a virtual server base image. After 

you create the base image, you can create a new virtual 

server guest by following these steps: 

1. Copy the virtual server disk image to a different 
folder. 

2. Rename the virtual server disk file to a different 
name than the base image disk file (e.g., "SQLI C 


CREATING A VIRTUAL 
SERVER BASE IMAGE 
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TABLE |: Disk Configuration for а 
Virtual Server Host that Will Run a 


RAID 1 Host and guest OS files 
Virtual guest SQL Server transaction log files 
Virtual guest SQL Server data files 
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the SQL Server with either differential or incremental 
hourly log dumps to tape or disk during the business 
day. But one of the main advantages of a virtualized 
SQL Server is the ability to obtain an image backup of 
the SQL Server by simply backing up the virtual guest 
files on the host server. If you’re running ESX Server 
you can perform a backup of any virtual server guest 
by using VMware’s Consolidated backup or Vizoncore’s 
esxRanger software while it is still running. 
Ifyou’rerunning VirtualServer2005 or VMwareServer, 
you must shut down the virtual server guest before youcan 
obtain a backup of it. Fortunately, you can write a script 
to automate the process of taking down the virtual server 
guests, running a backup of the host, then restarting the 
virtual server guests. I usually configure the backup to 
obtain an image backup of all virtual server guests once a 
week.You can find more information about virtual server 
backup strategies at www.windowsitpro.com/Article/ 


ArticleID/95309/95309.html. 


Disaster Recovery 

In a disaster recovery scenario, you can replicate these 
virtual server image files to a remote disaster recovery 
location. You could use the built-in feature of Windows 
2003 R2’s DFS Replication (DFSR) to automatically 
replicate these files to a remote location. From week to 
week, the virtual server image files don't change much 
so these types of files work well with DFSR because 
they can take advantage of the Cross-File Replication 
and Remote Differential Compression (RDC) features 
included with DFSR. The Cross-File Replication feature 
uses information from local files to duplicate files in 
a remote location. The 
RDC feature will replicate 
only changes to an existing 
file, greatly speeding up 
the replication process. 
So even though virtual 
server guest files tend to 
grow rather large, you can RAID 1 
replicate them relatively RAID 10 
quickly by using DFSR 
technology. 

Even with the image backup, I still use the traditional 
back up method so that I can perform a granular resto- 
ration of files or databases. Although the image backup 
is great for moving a virtual server from one host to 
another host, it's not the best solution when you want to 
restore a single file or portion of a transaction log. 


SQL Server Guest 
Disk RAID Array Files 


A Virtual Test Drive 

Running virtualized SQL Servers definitely has a 

learning curve, but the benefits are tremendous. Start off 

slow, with perhaps a test server or low profile SQL Server 

until you become familiar with the technology. Ea 
InstantDoc ID 95799 


server guests, make sure the processor is capable of x64 
processor virtualization. Processors that support x64 
virtual guests include AMD Athlon 64, revision D or 
later; AMD Opteron, revision E or later; AMD Turion 
64, revision E or later; AMD Sempron 64-bit capable 
version, revision D or later (experimental support); and 
Intel Extended Memory 64 Technology (EM64T) Vir- 
tual Technology (VT)-capable processors (experimental 
support). If you're using Intel processors, you'll need to 
enable the virtualization of the processor in the BIOS 
of the host server before you can install any x64 virtual 
server guest of the host server before you can install any 
x64 virtual server guest. 

Host hard drive configuration. The same basic disk 
configuration rules apply for a virtualized SQL Server 
system as for running SQL Server on a physical server. 
To configure a host to run SQL Server, set up the hard 
drives with at least one RAID 1 array for the SQL Server 
transaction logs and either a RAID 5 or RAID 10 array 
for the data portion of the SQL Server database. For 
larger SQL Server installations, it's a good idea to sepa- 
rate the transaction log and the data portion on separate 
RAID arrays. SQL Server transaction logs are written 
to sequentially and perform best when they reside on 
a RAID 1 array, which gives the best performance for 
sequentially written files. RAID 5 or RAID 10 arrays 
give the best performance for randomly accessed files 
and are a good choice for the data portion of any SQL 
Server database. For the best performance, run the x64 
bit version of SQL Server 2005. If you anticipate heavy 
disk activity from your virtual server guests, consider the 
host server disk configuration that Table 1 shows. Using 
this configuration, the host server might have 10 physical 
hard drives—two drives for the first RAID 1 array, two 
drives for the second RAID 1 array, and six drives for the 
RAID 10 array. Use SAS drives in the server and avoid 
using Serial ATA (SATA) or IDE drives. 

Virtual server guest add-ins (i.e., VMware Tools or 
Microsoft Virtual Machine Additions). Installing these 
add-ins should be one of the first steps you complete 
after the virtual server OS installation. Doing so will 
significantly improve the overall performance of any 
virtual server guest. 

SCSI drives for the virtual server guests. When you 
configure the disk type of the virtual server, always 
use SCSI disks. This is independent of the type of hard 
drives you're using on the host. 

Virus scanning. On the host server, disable virus scan- 
ning of all virtual server guest files. 


Backup Strategies 

Backing up virtualized SQL Server implementations 
presents some unique challenges and benefits compared 
with backing up SQL Server servers that are running 
on a physical server. With mission critical SQL Server 
applications, I typically perform a full daily backup of 
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Working with 
Estimated 


QUERY EXECUTION 


Estimating the complexity and size of query 
result sets їп SSMS and client applications 


SELECT ST.Name AS TerritoryName, 
SP.SalesPersonID, 
SP.CommissionPct 

FROM Sales.SalesTerritory 
ST INNER JOIN 
Sales.SalesPerson SP ON 
SP.TerritoryID - ST.TerritoryID 

ORDER BY ST.Name; 


You can download this query along with the rest of the 
sample code at InstantDoc ID 97267. This query is in the 
Showplan_Sample1.sq] file. 

You can see the result set and execution plan for this 
query in Figure 1 and Figure 2, respectively. Although an 
in-depth discussion of understanding execution plans is 
out of this article's scope, you can find more information 
about execution plans by reading “Optimizing a Subop- 
timal Query Plan," February 2007, InstantDoc ID 94775. 


However, let me 
ORE on the WEB 


point out a few 
basics. First, an Download the listing at 
execution plan is InstantDoc ID 97267. 
returned for each 
query submitted in a query batch. (In our example, we 
submitted only one query.) An execution plan is read from 
top to bottom, then right to left. For example, the first 
step in this execution plan is to perform an Index Scan 
of the AK SalesTerritory Name index to return the Ter- 
ritory name and ID columns. The results of this step are 
combined with the next step (a Cluster Index Scan against 
the SalesPerson clustered index). Then, working right to 
left, a nested loop is used to join the two tables together. 
Finally, the results of this nested loop are fed into the final 
SELECT operation. 

Each step lists a cost value, which represents the cost 
of a step relative to the entire query. In our example, the 
AK SalesTerritory Name Index Scan represents 44 per- 
cent of the total query cost. Similarly, if you run a batch 
of queries, you'll see the query cost relative to the entire 
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few months ago, one of my customers began 
A experiencing performance problems with their 

new reporting environment. After rolling out 
an ad hoc reporting tool, some of the business users 
began building reports that were very complex and, in 
some cases, resulted in tens of thousands of rows being 
returned. My customer wanted to prevent these types of 
reports from being executed. With this requirement in 
mind, I began investigating whether SQL Server execution 
plans could provide a solution. 

Let's take a look at how execution plans can be used 
to estimate both the complexity of queries and the size of 
query result sets. ГЇЇ first show you how to use execution 
plans within SQL Server Management Studio (SSMS), 
then show you how execution plans can be used from a 
client application. 


Execution Plan 


SalesPersonlD | CommissionPct Overview 
0.018 When you run a query 
0.01 against the relational data- 
0.015 base engine, SQL Server 
Т 015 carries out two major steps. 
TT The first step is to com- 
- pile the query into a plan. 
0.018 : 
2013 The second step involves 
ani executing the plan. SQL 
- Server 2005 can return a 
0.012 ۱ з 
query execution plan іп 
0.019 three formats: text, XML, 
0.01 and graphical. The easiest 
p.015 way to look at an execution 
0.01 plan is to view its graphical 
0.02 representation. With SQL 


Server 2005, you use SSMS 
to do so. For example, after opening a new query window 
in SSMS, select Query, Include Actual Execution Plan 
and run the following query against the AdventureWorks 
sample database: 


2 Results | Fe Messages IS Execution plan | 


FEATURE 


Tyler Chessman 
tylerc(Q) microsoft.com) is a Microsoft technology 


specialist who helps Microsofts database and 
business intelligence platform customers. Before 
joining Microsoft, he spent several years as a 
consultant implementing financial, analytical, and 
sales-force—automation solutions. 
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figuration.” This estimate is based on a test machine that Figure 2 


the SQL Server developers used in their lab. Therefore, this 
cost value is a loose estimate. It’s important to note that the 
actual execution plan cost can vary greatly depending on 
your environment at the time the query is running. 

I encourage you to run several different queries on 
your machine and examine the estimated values versus the 
actual values, as well as the actual time it takes to run the 
query. You'll then have a better idea of what type of cost 
threshold is realistic for your environment. 


Step details for the АК_ 
SalesTerritory Name 
Index Scan 
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Figure 3 


Using Estimated Execution Plans > çrırcr рсете 


іп an Application 

You should now have an understanding of how execution 
plans work and how to view the graphical form of an 
execution plan within SSMS. Now ГЇЇ walk you through 
how to use an execution plan within an application. I 
mentioned that execution plans can be returned in text, 
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For more information about statistics 
“Making the Most of Automatic Statistics 
Updating,” InstantDoc ID 96767 


For more information about execution plans 
“Optimizing a Suboptimal Query Plan,” 
InstantDoc ID 94775 


For more information about Report Builder 
“No-Nonsense Reporting Tool,” InstantDoc 
ID 50032 
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query batch listed inside the top of each execution plan. (ГЇЇ 
discuss more about what this cost value actually measures 
later.) Connecting each step is an arrow, whose thickness 
denotes the relative number of rows coming from each step. 
As you can imagine, information about each step in a plan 
can be very useful when trying to troubleshoot and optimize 
a query. 

Note that you can view additional details for any step 
in an execution plan by selecting or hovering over the step 
with your cursor. In Figure 2, you can see the step details 
for the AK_SalesTerritory_Name Index Scan. Several 
columns of information are displayed here. Note that 
if you want to see an explanation of each column, click 
a step, then make sure the Properties window is visible 
within SSMS. To make this window visible, select View, 
Properties Window in SSMS. Note that many estimated 
values (e.g., Estimated CPU Cost) are displayed, as well 
as several actual values. Estimated values show what the 
query engine expects to happen, which leads us into a 
discussion of estimated execution plans. 


Estimated Execution Plans 

To view an estimated execution plan, select Query, then 
choose Display Estimated Execution Plan in SSMS. Unlike 
an actual plan, the query isn’t run; instead, only an esti- 
mated plan is returned. For large, complex queries, you'll 
notice the estimated plan is returned very quickly (relative 
to running the actual query and viewing the actual plan). 
Also, when you look at the step details of an estimated 
plan, you'll see only estimated values (1.е., none of the actual 
values associated with actual plans are returned). 

As you can imagine, estimated values might not always 
match the actual values because several conditions, such as 
the cost of applying these filters and the number of rows 
returned, won't be known until after the query is executed. 
The query engine calculates these estimated values based 
on statistics (SQL Server maintains statistical information 
about indexes and column data stored in the database), so 
the accuracy of the estimates can be affected if the statis- 
tics are out of date. For more information about statistics, 
see "Making the Most of Automatic Statistics Updating," 
October 2007, InstantDoc ID 96767. 

To help meet my customer's requirement of identifying 
complex queries and/or queries that return large numbers 
of rows, we're going to focus on two estimated values: Esti- 
mated Number of Rows and Estimated Subtree Cost. The 
Estimated Number of Rows value is self-explanatory. Esti- 
mated Subtree Cost is defined as the estimated cumulative 
cost of an operation and all child operations. In Figure 3, 
Гуе highlighted the SELECT operation, which is the final 
step of our query. The estimated number of rows for this 
query is 14. The estimated subtree cost is 0.0075236. But 
how is this cost measured? 

According to SQL Server 2005 Books Online ( BOL), 
cost is measured as the "estimated elapsed time, in seconds, 
required to complete a query on a specific hardware con- 
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E i ESTIMATED QUERY EXECUTION PLANS 


or a query batch. There are also two text boxes 
near the bottom left side where you can enter 
values to represent the maximum desired cost 
and estimated rows for a query batch. When you 
click Run, the code shown in Web Listing 1 (www_ 
.sqlmag.com, InstantDoc ID 97267) is executed. 
This code establishes a connection with the database, 
then executes the SET SHOWPLAN XML ON 
command. Next, the code passes the query batch to 
SQL Server. SQL Server returns an XML version of 
the estimated execution plan. The plan is returned in 
a SQLDataReader object with one column of data, 
so the code iterates through the SQLDataReader object 
to concatenate the plan back into a single string. After 
executing the SET SHOWPLAN XML OFF command, 
the code uses XML Path Language (XPath) to parse the 
string and calculate the total cost and total estimated rows 
for the query batch. (Note that a user must be granted the 
SHOWPLAN permission, which is a database-level per- 
mission, to generate an XML showplan.) These values are 
displayed in the two text boxes near the bottom right side 
of the form and are compared to the maximum desired 
values. The status text box at the very bottom of the form 
indicates when one of the estimated values exceeds the 
corresponding maximum desired value. Although all of 
the code resides within the client application, you could 
alternatively use a SQL Server CLR stored procedure to 
house this logic in an instance of SQL Server. 

The sample application simply lets you examine the 
estimated cost and row count of a query batch, and 
compare those values to a maximum desired set of values. 
However, in your application, you could use this logic to 
examine and potentially prevent complex queries from 
running. 


A Custom Data Processing 
Extension for SSRS 

The customer I mentioned at the beginning of this article 
was using SQL Server 2005 Reporting Services (SSRS) 
and rolled out Report Builder for ad hoc reporting. (For 
more information about Report Builder, see my article 
*No-Nonsense Reporting Tool," June 2006, InstantDoc 
ID 50032.) To leverage estimated plans in SSRS, a custom 
data processing extension must be built. In an upcoming 
article, I discuss how I built this extension. 


Prevent Problematic 
Queries from Running 
Execution plans are a great tool for developers and DBAs 
alike. Typically, they're used to better understand, trouble- 
shoot, and tune queries. However, I also think execution 
plans have the potential to help prevent problematic que- 
ries from ever running. This kind of “proactive avoidance” 
can be especially useful in ad hoc reporting environments. 
500] 
InstantDoc ID 97267 
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FROM Production Product P INNER JOIN 
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This query E stmated Row Count exceeds the Mum Rowe Count 
Figure 4 . 

XML, and graphical format. There are two types of 
text plans: one that's generated using the SET SHOW- 
PLAN TEXT command and one that's generated using 
the SET SHOWPLAN ALL command. SET SHOW- 
PLAN TEXT returns a limited amount of information, 
which doesn't include the estimated values we need. Using 
SET SHOWPLAN ALL returns estimated values in a 
tabular format. It's а possible candidate for consump- 
tion in an application. However, the XML version of a 
plan provides the richest level of detail (graphical plans 
are actually built from the XML version of a plan) and 
produces a single XML document for the query batch. 
There are existing examples of extracting data from an 
XML plan, which we can use as a starting point. In 
particular, I have developed samples using some of the 
code found in the Microsoft article "Processing XML 
Showplans Using SQLCLR in SQL Server 2005" (msdn2 
microsoft.com/en-us/library/ms345130.aspx). 

In my sample project, EstimatedExecutionPlan_ 
WinApp.vbproj, I have built a .NET Windows applica- 
tion that passes a query batch to a SQL Server 2005 
database, which you can see in Figure 4. Let's briefly 
walk through the application logic. In the top part of the 
main window, there's a text box where you can enter a 
connection string for a SQL Server 2005 database. The 
large text box in the middle is used to specify a query 


Passing a query batch 
to a SQL Server 2005 
database 


ESTIMATED EXECUTION PLANS’ 
SHORTCOMINGS 


During my testing, | discovered that estimated execution plans aren’t always gener- 
ated. The SQL Server 2005 Books Online (BOL) topic “Transact-SQL Statements 
That Produce Showplans” has a complete list of query statements that don’t gen- 
erate a plan, but what | found disappointing was the fact that statements using 

the sp_executesq| stored procedure don’t generate an estimated plan. (I expect 

sp executesql might show up in ad hoc reporting applications.) BOL also states 

that dynamic SQL statements don't generate estimated plans. Just to be clear, in 
this case, dynamic SQL statements means SQL statements in the form EXEC @ 

SQL. Statement, where the variable GSQL Statement contains the T-SQL query. You 
can certainly extract the query from the EXEC statement variable and pass it to SQL 
Server to generate an estimated plan. | imagine you could also develop code to parse 
statements that use sp executesql to generate an estimated plan. However, the fact 
remains that you're not always guaranteed an estimated plan. So if you decide to use 
estimated plans, make sure you incorporate error handling to account for situations 


where estimated plans are missing. 
InstantDoc ID 97268 
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Sharpen Your Basic 
SQL Server Skills 


Brush up on FILLFACTOR, ISNULL, NULLIF 
and % as wildcard and operator 


ISNULL to compare a value with NULL. You can use 
CASE instead of NULLIF or ISNULL. However, using 
NULLIF or ISNULL reduces the amount of T-SQL 
script you need to write. Only a negligible performance 
difference exists between using NULLIF or ISNULL, 
or using CASE. Listing 3 demonstrates NULLIF and 
ISNULL, comparing each with an equivalent CASE 
statement. (Notice how much more code CASE requires.) 
Run the script in Listing 3 two times with two different 
values. The first time you run the script, set @expression2 
to 0, and the second time set @expression?2 to 1. 


o 


T 


Pinalkumar Dave 
pinal @ sqlauthority.com) is a database 


administrator and project manager. He writes 
regularly about SQL Server technology at 


www.sglauthority.com. 


ORE on the WEB 


Download the code at 
InstantDoc ID 97337. 


О: What's the difference between % used as an 
operator and % used as a wildcard? 
A: When % is used as an operator, the modulo 
returns the remainder of a division operation. Using 
% as a wildcard character matches any length of the 
string or number in a WHERE clause. Listing 4 contains a 
query that uses % as modulo and as wildcard. En 
InstantDoc 10 7 


LISTING ۱: NULLIF Script 


SELECT NULLIF(1,1) TestEqualNULLIF, NULLIF(1,0) TestNotEqualNULLIF; 


LISTING 2: ISNULL Script 


SELECT ISNULL(NULL,1) TestNULLValue, ISNULL(10,1) TestNotNULLValue; 


LISTING 3: NULLIF and ISNULL Compared with CASE 


DECLARE gexpression1 INT; 
DECLARE gexpression2 INT; 
SET 8expression1 5 

SET 2 


» 


0; 


--Example of NULLIF 

SELECT NULLIF(8expression1,gexpression2) Result; 

SELECT CASE WHEN дехргеѕѕіоп1 = дехргеѕѕіоп2 THEN NULL ELSE @expression1 
END Result; 


--Example of ISNULL 

SELECT ISNULL(@expression1,dexpression2) Result; 

SELECT CASE WHEN дехргеѕѕіоп1 IS NULL THEN @expression2 ELSE @expression1 
END Result; 


LISTING 4: % as Modulo and Wildcard 


USE AdventureWorks; 
GO 
SELECT CAST((UnitPrice) AS INT) X OrderGty AS Modulo, 
CarrierTrackingNumber, CAST((UnitPrice) AS INT) UnitPrice, 
OrderGty 
FROM Sales.SalesOrderDetail 
WHERE CarrierTrackingNumber LIKE '49117' 
GO 
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his month Гуе been hearing from new DBAs who've 

asked about the difference between FILLFACTOR = 
0 and FILLFACTOR - 100 when creating or rebuilding an 
index and the difference between ISNULL and NULLIF 
(and how they each compare with CASE). You also want to 
know how % used as a wildcard differs from % used as an 
operator. ГЇЇ answer these top questions on the basics and give 
you some sample scripts. If you can't wait a month to catch 
up on SQL Server tips and information, check out my blog at 
www.sqlmag.com/go/SQLskills. 


Q: What’s the difference between using FILLFACTOR = 
0 and FILLFACTOR = 100 when creating or rebuilding 
an index? 

A: FILLFACTOR = 0 and FILLFACTOR = 100 are 
the same in all respects. FILLFACTOR, a percentage 
measurement, determines how much data is filled on 
the leaf level of each index page when an index is cre- 
ated or rebuilt. The minimum value for FILLFACTOR 
is 0, and the maximum value is 100. The default is 
FILLFACTOR = 0. Once modified from 0 to any other 
value, the FILLFACTOR value can’t be set again. Set- 
ting FILLFACTOR to 100 has the same effect as setting 
it to the default of 0, because both FILLFACTOR = 
0 and FILLFACTOR = 100 tell SQL Server to com- 
pletely fill the leaf page. No space is unused. Using 
a value between 1 and 99 always leaves a percentage 
of the leaf page space unused. For example, FILL- 
FACTOR = 80 leaves 20 percent of the space unused. 


Q: What’s the difference between NULLIF and ISNULL? 
A: The difference between NULLIF and ISNULL is that 
NULLIF returns NULL, whereas ISNULL replaces 
NULL. NULLIF checks for certain parameters and 
returns NULL if the parameters are equal; otherwise it 
returns the first parameter. Listing 1 contains a NULLIF 
script that returns NULL for TestEqualNULLIF and 0 
for TestNotEqualNULLIF. ISNULL checks for the value 
of a parameter and replaces NULL with another value 
if the parameter's value is NULL. Listing 2 contains an 
ISNULL script that returns 1 for TestNULLValue and 10 
for TestNotNULLValue. 


Q: What's the typical usage of NULLIF and ISNULL? 
A: Use NULLIF to compare two values, and use 
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Bytes from the Blog 
www.sqlmag.com/go/industrybytes 


smart enough to turn off compression when it encounters 
uncompressible data." 


Virtualization and SQL Server 
Virtualization is the hottest topic at IT gatherings these 
days and PASS was no exception. One of the best- 
attended sessions at PASS was an expert panel discussion 
called Virtualization and SQL Server in the Enterprise. 
Quest Software hosted the panel, and the speakers 
included PASS President (and Quest's Director of Tech- 
nology for the SQL Server Solutions Group) Kevin Kline; 
Matt Hollingworth (a Microsoft SQL Server senior pro- 
gram manager); Allan Hirt (consultant and author); Ron 
Talmage (cofounder of Solid Quality Mentors); Scott 
Herold (vice president of Product Engineering at Invirtus); 
Darren Bieniek (global practice manager for Microsoft 
SQL Server at EMC Corporation); and John Paul Cook 
(a database and system architect). 

Most of the discussion revolved around the best way 
to use SQL Server for virtualization. Participants agreed 
that virtualization was a good choice for backup, disaster 
recovery, testing non-critical systems, and development 
testing. 

All the panelists had reservations about using virtu- 
alization with high-transaction SQL Server systems, but 
a good percentage of the audience disagreed: Several 
attendees indicated that they were already running their 
production SQL Server solutions in virtualized environ- 
ments. Although most of these attendees seemed to come 
from small-to-midsized business (SMB) organizations, it 
was clear that virtualization would continue to have an 
impact on how SQL Server is deployed and used. 

The panelists remained more conservative in their 
approach to virtualization, suggesting that the modest 
performance hit involved in virtualizing SQL Server still 
made them reluctant to endorse its use on live, production, 
and critical SQL Server installations. 

“Note that this is the answer 
that were giving this year,” said 
panelist John Paul Cook. “Come 
back to PASS next year and 
some of our answers may have 
changed.” SOU 

—Jeff James 
InstantDoc 10 37215. 


he SQL Server community is highly engaged and pas- 

sionate about honing its skills and catching up on the 
latest developments and products. I experienced this lively 
community at this year's Professional Association of SOL 
Server (PASS; see www.sglpass.org) Community Summit 
in Denver. Roughly 2,500 SQL Server developers, admin- 
Istrators, and business intelligence (BI) specialists attended 
PASS, along with more than three-dozen vendors who 
exhibited their products at the conference's exhibition. 
Although attendance at PASS has waxed and waned in 
recent years, this year's event was well populated with ses- 
sions, exhibitors, and attendees. I had the opportunity to 
talk with representatives of Microsoft and other vendors, 
examine new products on the expo floor, and meet some 
of the attendees and speakers. 


SQL Nitro Touts 
Data Compression 
Microsofts presence at the 
show was significant, including 
a massive booth and a large, 
bus-sized vehicle. Microsoft 
representatives were also fea- 
tured in many breakout ses- 
sions and talks throughout 
the event, underscoring Micro- 
soft's support for PASS and the 
SQL Server customer community. 

In the expo hall, SQL Server utility provider DBA 24 
Hours was demonstrating DBLaunch 2.5 and SQLNitro 
2.0. DBALaunch lets DBAs automate and standardize 
deployment and maintenance activities SOLNitro opti- 
mizes data on the network going into and out of SQL 
Server databases. 

According to DBA 24 Hours Product Manager Bob 
Boule, SQLNitro manages to achieve those goals by opti- 
mizing the Tabular Data Stream (TDS) protocol. *The 
native network protocol may transfer one packet of 4096 
bytes into one TCP buffer,” explains Boule. “SQLNitro 
can pack more than a dozen packets in a single buffer, 
which greatly reduces files sizes." 

Boule also noted that SQLNitro relies on an adaptive 
compression technology that can sense whether compres- 
sion would be usable in each instance. “SQLNitro will 
tune itself when it can make a positive change, but it's also 


Bob Boule 
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PolyServe Database Utility 


for SQL Server 


A unique approach to consolidation and availability 


are just aliases for physical instances on the host Windows 
OS, so they don’t require their own virtualized OS, as is the 
case with a VM. 

To implement PolyServer, you begin by selecting as 
many as 16 SAN-connected servers that will belong to 
the PolyServe cluster (aka matrix). (The servers are run- 
ning Windows Server, but aren’t clustered using Windows 
Clustering.) Installing PolyServe puts a replaceable file 
system driver called PolyServe File System (PSFS) and the 
PolyServe service 
on each. server. ORE on the WEB 
HE Provides um Read the full-length 
engineer to assist version of this review at 
with designing and InstantDoc ID 97338. 
installing it into 
your environment. 

To create a clustered file system on the SAN, Poly- 
Serve uses its NTFS-compatible PSFS driver to format 
the SAN LUNs into volumes that can be shared by one 
or more servers in the matrix. PSFS maintains Windows- 
compatible locking and concurrency on the files by using 
PolyServe's Distributed Lock Manager. The result is that 
you can mount a particular SAN LUN as drive G (eg., 
for backups) on several or even all database servers in the 
matrix, something you can't do using standard Windows 
Server file drivers. 

After installing a physical SQL Server instance, you 
use the PolyServe management utility to assign a virtual 
IP address and virtual SQL Server instance name to the 
physical instance. Using PolyServe's Dynamic Re-Hosting 
features, you can then install failover target instances for 
each virtualized SQL Server instance. 

PolyServe's failover for SQL Server instances differs 
from Windows failover clustering: The database servers 
need not be all from the same hardware vendor, and they 
can have different CPU and memory configurations. 

In my own experience, HP PolyServe Database Utility 
for SQL Server is most useful for managing SQL Server 
consolidated instances across several database servers, 
with the added benefit of failover clustering for those 
mission-critical instances requiring high availability. Man- 
agement of the instances is very intuitive, making both the 
current locations of all instances, as well as their potential 
failover targets, visible at a glance. ЕЙ 

InstantDoc ID 97338 
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extends the host OS 


Ron Talm age recently tested HP's PolyServe Database Utility for SQL 
(rtalmage@ prospice.com) owns Prospice, a Server, which takes a unique approach to SQL Server 
database-consulting firm. He is president of consolidation and availability. HP acquired PolyServe in 

the Pacific Northwest SQL Server Uses Group, February 2007, integrating it into ће HP StorageWorks 
is an MCSD, an MCP and a SQL Server MVP. 2. . . : 
He is the author of Microsoft SQL Server 7 Division. The new HP version includes major enhance- 
Administrators Guide (Prima Tech, 1999). ments such as support for recovery of a PolyServe cluster 
at a remote site. Although it extends HP SAN management 
options, PolyServe also works with other brands of servers 
and SAN hardware. PolyServe supports several OSs, 
including Windows Server 2003 R2 64-bit and Windows 
2003 or later 32-bit and Linux, as well as server applica- 
tions such as SQL Server and Oracle. My review focuses 
on version 3.6 enhancements for SQL Server on Windows 
Server. 

PolyServe is a software solution for managing both 
SQL Server consolidation and high availability. Using 
PolyServe, you can easily move SQL Server instances from 
server to server within a “matrix” of as many as 16 servers 
that are all connected to a common SAN. 

PolyServe' approach to SQL Server consolidation and 
availability is unique because, unlike traditional virtualiza- 
tion software, PolyServe doesn’t create a virtual machine 
(VM) with its own OS and file storage. Instead, PolyServe 
extends the native Windows Server host OS by virtualizing 
SQL Server instance names on the host Windows OS and 
by making storage volumes on a SAN accessible to multiple 
servers in the matrix. The virtualized SQL Server instances 


POLYSERVE DATABASE 
UTILITY FOR SQL SERVER 


Pros: Simplified consolidation and failover for SQL Server based on a 
clustered file system and SQL Server instance virtualization; SQL Server 
instance failover is easier and more flexible than Windows clustering; recovery 
of a PolyServe cluster at a remote site is now supported 


Cons: Installation requires careful planning and a support engineer to assist with 
design and installation; failover support for SSAS is scheduled for a future release 


Rating: ۲۲ ۲ 
Price: $6,800 per CPU socket 


Recommendation: For enterprise and mid-size organizations, | recommend this 
product for SQL Server consolidation and high availability. 


Contact: HP • 877-476-5973 • www. polyserve.com 
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NEW PRODUCTS 


HARDWARE 


Preconfigured Hardware Platforms for Business Intelligence 

Microsoft and Dell have joined forces to create new Business Intelligence (BI) and Data Warehousing reference platforms 
that include preconfigured sets of software, storage, and servers designed for BI and data warehousing applications. 
Three versions of the reference configuration platform are offered, differentiated by disk size. 

Each of the reference configurations 
ships with a database server, an analysis 
server, and a reporting server, all based on 
Dell PowerEdge hardware running SQL 
Server 2005 and Windows Server 2003. 
Buyers then choose a 1TB, 2TB, or 4TB 
storage option, with appropriate storage 
provided by Dell PowerVault MD1000 
storage arrays. 

Microsoft General Manager of SQL 
Server Tom Casey mentioned in an inter- 
view that these configurations should help 
customers take the guesswork out of assembling an optimal platform for BI. “These new configurations 
don’t require customers to learn new skills to solve their BI problems,” said Casey. “This makes BI easier to 
deploy and will speed up initial adoption of BI capability.” 


All three configurations are available now. For more information, contact Dell at 877-671-3355 or visit SOL Server 
www.dell.com/sqlbi. аа. 
Sams Teach Yourself 
TRAINING Microsoft 


Learn SQL Server T-SQL in 10 Minutes SQL Server 


SAMS Teach Yourself Microsoft SOL Server T-SOL in 10 Minutes is a new book from SAMS Publishing T- SQL 


Got a great new product? 
Send announcements to products@ sqlmag.com 
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that promises to give DBAs the information they need to get started using the T-SQL language. Written by 
Ben Forta, the book explains how to create T-SQL statements that can be used to filter data, join tables, 
and automate processes with triggers. SAMS Teach Yourself Microsoft SQL Server T-SQL in 10 Minutes 
is available now and retails for $19.99. For more information, contact SAMS at 201-767-5021 or visit 
www.informit.com. 


DATABASE MIGRATION 


Migrate Data from Microsoft Access to SQL Server 
DMSoft Technologies has announced DBConvert 2.0, an 
р Customization — updated version of its database migration tool. DBConvert is 
- - سے‎ used to convert data to and from Microsoft Access and SQL 
— | || Server database file formats. The software also lets DBAs 
з apply customizable data filters that specify 
lame ee stesse ӘУ | migration criteria, and a command-line option 
gives more precise control over the migration 
process. Pricing for DBconvert begins at $69. 
For more information, contact DMSoft Tech- 


q uv 777۳ 


nologies at esupport@dbconvert.com or visit‏ | کے eee‏ ی سس 
Loud ay Be „лнй dnd e‏ 
NINE IEEE‏ 

www.dbconvert.com. 


SQL Server Magazine * www.sqlmag.com 


ORACLE DATABASE EXTRACTION " 
Optimize Data Extraction from Oracle Databases FA( - te 
Innovative Routines International Inc. (IRI) has shipped a Windows 

version of CoSort Fast Extract (FACT), a tool that lets DBAs transfer CoSORT FAst extraCT 
Oracle tables to external files. According to IRI, the product can unload 

large Oracle tables to flat files in a variety of formats. FACT can also be used in conjunction with CoSort’s SortCL and 


Oracle’s SQL*Loader tools, a combination that IRI claims will improve high-volume migration and replication opera- 
tions. For more information, contact Innovative Routines International at 321-777-8889 or visit www.cosort.com. 


DATABASE UTILITIES 


Manage Database Operations 

Red Gate Software has released SQL Data Compare 6, a new version of its database comparison product. 
The update introduces Row Level Restore, which lets DBAs repair corrupted or missing data and restore it 
to its original state. SQL Compare 6 also allows comparison between live and backup databases and between 
two offline backup files. Pricing for SQL Data Compare 6 begins at $395. For more information, contact Red 
Gate Software at 866-733-4283 or visit www.red-gate.com. 


DATABASE MANAGEMENT 


Optimize Database Performance 

UpTempo from Datacore Software is a new product designed to boost the performance of Microsoft Windows 
Server platforms and the applications that run on them. Datacore claims that the software works by making 
the servers read and write data to fixed-disk storage more efficiently, and by eliminating I/O bottlenecks on the network. 
Pricing for UpTempo ranges from $49 for Windows XP systems to $498 for Windows 2003 Server systems. For more 
information, contact Datacore at 954-377-6000 or visit www.datacore.com. 


DEVELOPMENT 

Create Database Applications for — Sn 
REAL Basic بسن سس‎ ium | auis MONDE аст) 

REAL Software has announced REAL SQL Server, а Re 
multi-user relational database for REALbasic applica- مج وید‎ TAS) 

tions. REAL SQL Server can support hundreds of Ei RSE OE RE Gp 00 
simultaneous users (although 50 or less are recom- —ÀÜ 

mended), and REAL Software claims that migrating Г Om pt Qo тн dtu о 

data from the REAL SQL Database to REAL SQL = pum 

Server is made easier because both applications are 

based on the same database engine. REAL SQL Server is 

available now and is priced at $500 per server. For more 

information, contact REAL Software at 512-328-7325 — — 


or visit www.realsoftware.com. - نیسح ڪڪ‎ 


STORAGE 


Improve Data Acquisition and Warehousing 
Texas Memory Systems’ new RamSan-500 provides high read performance and large-capacity storage. The RamSan-500 
is the first enterprise-class cached flash storage system, as well as the fastest flash-based solid state disk (SSD) currently 
available. Because the RamSan-500 uses flash technology in addition to as 
much as 64GB of Double Data Rate (DDR) RAM, its capacity 
is larger (2TB in a 4U chassis) and its price is lower than SSDs 
based strictly on DDR RAM. The RamSan-500 makes super- 
high-performance SSDs more accessible for mainstream applica- 
tions, such as those based on Microsoft SQL Server or Oracle 
databases. For more information, contact Texas Memory Systems 
at 713-266-3200 or visit www.texasmemorysystems.com. EM 
InstantDoc ID 97323 
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“The ability to maintain 
clean, reliable marketing 
data across multiple capture 
mediums is essential for 
everything we do.” 


- Kevin Parker, Digital Evergreen 


A better way to 
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Address Object API 


Verify, cleanse and format 
customer data at the point of 
entry or in batch. Easily 
integrate with .Net, MS SQL 
or Java. 


Get a demo: 
1-800-MELISSA 
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Microsoft SQL Server 2005 20 CD-ROMs or 2 DVD-ROMs 


Microsoft SQL Server 2005 Reporting Services (SSRS) 8 CD-ROMs or 1 DVD-ROM 
Microsoft SQL Server 2005 Analysis Services (SSAS) 8 CD-ROMs or 1 DVD-ROM 


Microsoft SQL Server 2005 Integration Services (SSIS) 8 CD-ROMs or 1 DVD-ROM 


BUY 1, GET 1 


For a limited time, purchase one of our SOL Server 2005 courses above, get another SOL Server 2005 course 
(of equal or lesser value) FREE! Or get all four SOL Server 2005 courses in one money-saving learning suite! 


Visit our Web site today for offer details, plus course outlines and 
more information about our new SQL Server 2005 courses. 
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SQL Server Virtualization FAQs 


What types of SQL Server implementations are best suited 
to virtualization? 

Because virtualization adds overhead, the raw performance 
of a VM won't match that of a physical implementation. 
Virtualize smaller or departmental SQL Server workloads 
where there is excess CPU and I/O capacity. Be wary of 
virtualizing database workloads that have high resource 
requirements. However, many enterprise databases can be 
virtualized and still meet their service level agreements. It 
just takes more planning. 


Does virtualization make SQL Server administration 
harder or easier? 

Virtualization is not a panacea for server administration. 
Although they're on a common hardware platform, each 
virtual server still has the same management require- 
ments as a physical server—services need to be started 
and stopped, queries need to be tuned, databases need 
to be backed up. You can still use management tools 
such as Enterprise Manager and SQL Server Manage- 
ment Studio just as you would to manage a standard 
physical server. 


SQL Server is licensed per processor—how does that 
work if you're running multiple SOL Server VMs on an 
8-way host? 
Microsoft licenses SQL Server per virtual CPU, not by 
the number of host CPUs. If you have a SQL Server 
VM that uses one virtual CPU and the VM runs on an 
8-way host, then you only need one CPU license for SQL 
Server. Microsoft allows an unlimited number of virtual 
instances for SQL Server 2005 Enterprise Edition, with no 
additional licensing costs. |500 
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M is one of the hottest trends, especially 
to combat server sprawl and increase operational 
efficiencies, and it can reduce TCO. However, database 
workloads like SQL Server can be some of the most 
troublesome to virtualize. Here are commonly asked ques- 
tions about virtualizing SQL Server. 


Is virtualization the only way to consolidate servers? 
Although virtualization is one of the best ways to perform 
server consolidation, it's not the only way. For instance, 
you can consolidate database servers simply by adding 
additional databases and their workloads onto a single 
server. In addition, you can run multiple instances of SQL 
Server on the same system. These are examples of how to 
consolidate your workload without virtualization. 


How does virtualization help with server consolidation and 
operations in general? 

First, because virtualization technology abstracts the 
server or OS layer from the underlying hardware, it's easier 
to move a server to different hardware for better scal- 
ability or for disaster recovery. Next, not all workloads or 
applications are compatible with one another and server 
consolidation using virtualization enables you to keep all 
your databases on completely separate servers each run- 
ning under their own OS. 


How much overhead does virtualization impose? 

The actual overhead depends primarily on the virtualiza- 
tion infrastructure and the I/O characteristics of the work- 
load that's running. However, a general rule of thumb is to 
expect a 10 to 15 percent reduction in performance between 
a physical machine and a virtual machine (VM). Obviously, 
if you're running in a server consolidation environment, 
other active VMs will make an impact on overhead, too. 
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THE ULTIMATE 
BACKUP & RECOVERY SOLUTION: 


SOL safe v4.5 


THE ULTIMATE DIAGNOSTICS & 
PERFORMANCE MONITORING SOLUTION: 


SOL diagnostic manager v5.0 
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Try Idera’s new SOL Server management solutions for yourself free for 14 days. Download everything 
you need for the SOL Server ultimate management solution at: 


